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Chapter  1 

INTRODUCTION  TO  FINAL  REPORT 


This  research  focuses  on  the  issues  relavant  to 
designing  practical  high  performance  CMOS  ULSI  circuits 
utilizing  the  1.2  micron  CMOS  technology  obtained  from 
MOSIS.  As  such  the  research  has  supported  the  following 
ac  t i v i ties: 


1)  the  vendor  interface  for  MOSIS, 

2)  the  development  of  design  rules, 

3)  the  characterization  of  the  vendor  processing, 

4)  the  development  of  the  CAD  tools  for  the  designers, 

5)  the  development  of  circuit  methodologies  appropriate 

for  CMOS  m i cr ocompu ter s ,  and 

6)  the  development  of  canaries,  experimental  circuits 

which  demonstrate  the  circuit  and  design 
me  thodol og i es . 


Much  of  the  information  relative  to  the  vendor  interface  and 
the  individual  vendor  process  characteristics  is  company 
proprietary.  After  the  envelope  of  process  char ac ter i st i cs 
from  the  various  vendors  is  developed  by  MOSIS,  this 
appropriate  information  is  provided  to  the  community 
directly  by  MOSIS.  Consequently,  this  report  will  address 
the  research  relative  to  CMOS  circuit  development. 


The  concept  of  scalable  CMOS  design  rules  to  facilitate 
fabrication  of  the  design  at  current  production  technologies 
down  to  advanced  UHSIC  geometries  is  developed  in  the  next 
chapter.  These  design  rules  are  applicable  for  service 
through  the  MOSIS  system  with  MOSIS  providing  the  processing 
of  the  design  file  to  translate  to  the  actual  mask 
dimensions  used  in  the  pattern  generator.  The  intent  is  for 
MOSIS  to  provide,  in  the  future  char ac ter i zat i on  data,  for 
the  different  technologies  supported.  This  characterization 
is  to  be  developed  under  other  funding  and  implemented  in 
both  the  absolute  and  lambda-based  terms. 


The  design  rules  are  presented  in  the  "user-friendly" 
version  distributed  by  MOSIS  and  in  the  compact  style  with 
the  extensive  calculations  as  developed  at  MSU  under  this 
contract. 


Previous  research  at  MSU  has  dealt  with  clocked  CMOS 
circuit  styles  with  some  application  to  gate  array  and 
m  i  cr opr ocessor  applications.  Work  under  this  contract  has 
allowed  these  concepts  to  be  developed  in  the  full  custom 
environment,  but  focuses  on  the  requirements  in  implementing 
custom  microprocessors. 


The  -first  application  area  presented  in  this  report 
deals  with  structured  logic  schemes  based  on  Programmable 
Logic  Arrays  <PLAs>.  Three  di-f-ferent  PLA  design  methods  are 
reported  with  a  comparison  o-f  the  d  i  f  -f  erences .  In 
addition,  a  structured,  but  flexible,  version  of  the  PLA,  in 
the  form  of  the  PPL  < Progr ammabl e  Path  Logic)  is  developed. 
Appreciation  is  expressed  to  Kent  Smith,  et  al  ,  of  the 
University  of  Utah  for  their  cooperation  in  the  PPL 
research.  The  MSU  version  utilizes  clocked  CMOS  circuitry 
in  its  implementation,  although  a  ripple  signal  is 
propagated  through  the  logic  between  the  state  latches. 
This  clocked  version  is  appr ox i mate  1 y  four  times  smaller 
than  that  of  the  static  CMOS  version  first  obtained  at  Utah. 
The  structured  logic  circuit  development  is  aimed  at  an 
efficent  method  of  generating  the  "glue"  logic  around  the 
other  major  functional  blocks  used  in  the  design  of 
m i cr ocompu  ters . 

The  issues  relative  to  the  design  of  a  CMOS  data  path 
are  dealt  with  in  the  next  part  of  the  report.  Clocked  CMOS 
circuitry  is  developed  with  material  presented  in  a  tutorial 
form  for  students.  The  concept  of  a  N-mostly  design  is 
developed  in  comparison  to  a  more  conventional  CMOS  circuit 
style.  The  N-mostly  circuit  approach  utilizes  PMOS  for 
latches,  buffers,  and  precharge  devices  with  transfer  gates 
and  most  logic  circuitry  implemented  in  NMOS.  The  N-mostly 
approach  saves  considerable  area  in  the  circuitry  associated 
with  transferring  data  over  the  bus,  through  the  shifter, 
and  into  the  ALU  input  latches.  The  full  data  path  canary, 
developed  under  this  research  effort,  is  implemented  using 
single  layer  metal  and  conventional  CMOS  transfer  gates.  It 
is  designed  with  the  original  1.2  micron  design  rules  which 
are  similar  but  not  compatible  with  the  scalable  set. 
Another  experimental  circuit  has  been  designed  to  test  the 
N-mostly  approach  to  the  bus  transfers. 

The  data  path  report  also  includes  the  development  of  a 
pad  frame  with  I/O  circuitry  to  be  used  by  other  designer? 
with  the  1.2  micron  technology.  Although  this  work  preceded 
the  released  MOSIS  version  of  the  scalable  design  rules  and 
is  not  “scalable",  present  work  with  MOSIS  is  leading  to  a 
redesigned  padframe  which  is  compatible  with  the  scalable 
concept.  The  circuit  style  in  this  report  is  carried  over 
to  the  scalable  version  to  be  released  in  the  future  by 
MOSIS. 

A  m i crosequencer  designed  for  CMOS  is  developed  in  the 
next  part  of  the  report.  This  microsequencer  is  an 
enhancement  of  the  Mead  and  Conway  version  and  is  similar  to 
the  AMD  2900  series.  An  experimental  canary  based  on  this 
research  is  being  reworked  to  incorporate  the  new  scalable 
design  rules,  again  under  separate  funding.  This 
m i crosequencer  is  expected  to  become  the  heart  of  a  control 


unit  to  be  implemented  in  the  scalable  rules.  Our  plans  at 
MSU  are  to  incorporate  in  the  -future  this  basic 
microsequencer  with  two  small  ALUs  -for  control  of  external 
memory  and  internal  data  memory  to  form  a  fairly  powerful 
control  unit,  except  for  the  programmable  control  store. 

The  final  part  of  the  report  deals  with  the  development 
of  a  tester,  suitable  for  university  researchers  to  evaluate 
their  experimental  canaries.  The  tester  allows  the 
downloading  of  test  vectors  from  a  HP9820  through  a  16  bit 
port  into  dedicated  memory  for  testing.  This  memory,  which 
is  modular  in  construction  with  8  channels  per  module, 
facilitates  testing  at  speeds  up  to  a  10  MHz  clock  rate. 
The  device  under  test  <DUT)  outputs  are  loaded  back  into  the 
tester  memory  in  real  time  and  evaluated  off-line  after 
being  uploaded  into  the  HP9820 .  Fairly  simple  control  of 
the  logic  levels  at  the  DUT  interface  are  provided.  This 
arrangement  minimizes  the  complexity  of  the  circuitry 
involved  but  is  suitable  for  university  research. 


Chapter  2 

SCALABLE  CMOS  DESIGN  RULES 


2.1  Objective  of  Scalable  Rules 

The  objective  of  developing  scalable  CMOS  design  rules 
is  two  fold:  1)  to  facilitate  translating  a  design  data 
base  from  one  CMOS  technology  to  another  technology 
tr ansparan t 1 y ,  and  2)  to  fabricate  more  economically  student 
projects  or  other  experimental  designs,  first,  at  the  3 
micron  technology  and,  later,  at  the  1.2  micron  technology. 

The  MOSIS  system  of  supplying  fabrication  service 
primarily  achieves  its  cost  effectiveness  by  spreading  the 
fabrication  cost  over  many  different  designs.  The 
fabrication  masks  contain,  maybe,  50  to  200  different 
designs  for  each  wafer.  The  tooling  (masks)  and  wafer 
fabrication  cost  of  approximately  $25,000  is  spread  over  the 
50  or  more  designs  to  achieve  less  than  $500  fab  cost  for 
each  individual  design.  Utilizing  full  wafer  exposure 
allows  the  entire  wafer  area  to  be  unique.  In  contrast, 
standard  production  procedures  commonly  utilize  several 
hundred  patterns  which  are  repeats  of  a  single  design. 

At  the  1.2  micron  technology,  full  wafer  exposure  is 
prohibitively  expensive  with  direct  e-beam  or  X-ray 
1  i  thography  being  the  candidates.  For  production 
requirement  reasons,  the  silicon  foundries  emphasize  wafer 
steppers  which  provide  only  approximately  one  square 
centimeter  of  unique  pattern  area,  which  is  repeated  across 
the  wafer.  This  limited  pattern  area  limits  the  number  of 
designs  which  can  absorb  the  rather  high  fabrication  cost. 
The  results  is  that  prototyping  at  the  1.2  micron  level  will 
result  in  fabrication  cost  at  least  lOx  over  the  3  micron 
fab  cost. 

At  least  for  the  learning  exercises  and  initial  debug 
phases  of  design,  it  is  desirable  for  a  designer  to  be  able 
to  experiment  at  the  more  cost  effective  CMOS  technology  and 
proceed  to  the  high  performance  technology  as  justified. 
Scalable  design  rules  permit  the  regeneration  of  the 
required  artwork  without  extensive  designer  rework. 


2.2  The  Approach  with  MOSIS 

The  concept  of  the  scalable  rules  evolved  from  our  work 
at  Mississippi  State  University  to  facilitate  both  the 
teaching  of  CMOS  circuit  design  and  the  research  effort 
relevant  to  the  1.2  micron  program.  In  order  to  generate 
experimental  structures  with  various  design  rules,  we 
developed  our  own  software  GRAIL  with  a  technology  table  of 
feature  sizes  which  are  pointed  to  from  the  design  data 
base.  By  changing  the  scale  factor  of  the  grid  system  and 
the  technology  table,  a  design  can  be  run  at  different 
technologies.  The  feature  sizes  are  only  loosely  tied  to 
the  scaling  factor,  thereby,  permitting  non-linear  scaling 
and  compensations  as  needed  for  the  particular  technology 
and  particular  vendor.  GRAIL  provides  the  flexibility  of  a 
symbolic  design  system  with  the  readabi 1 i ty  of  conventional 
mask  drawings.  This  system  is  unique  to  MSU ,  however,  and 
is  not  easily  transported. 

MOSIS  requested  a  scheme  <system)  with  the  following 
proper  ties: 

1)  It  should  not  require  any  additional  software  by  the 
designer  —  thus  ruling  out  symbolic  methods. 

2)  It  should  be  as  simple  as  possible  for  the 
convenience  of  the  design  community,  but  compatible 
with  all  of  the  existing  design  methods  used  by  the 
MOSIS  community. 

MOSIS  would  accept  a  rather  course  grid  design  which 
more-or-less  represented  the  design  (symbolic?);  MOSIS  would 
then  "operate"  on  the  design  file  with  MOSIS  software  which 
would  provide  the  translation  to  high  resolution  masks  for  a 
particular  technology.  Simple! 

2.3  The  Challenges  and  Issues 

Reviewing  the  different  design  methods  used  in  the 
community  and  certain  constraints  from  the  MOSIS  environment 
revealed  very  interesting  trade-offs,  too  many  to  elaborate 
on.  However,  the  general  problem  is  worthy  of  discussion. 

First,  in  spite  of  what  is  drawn  (or  entered  in  the 
GIF  design  file  which  is  submitted  to  MOSIS)  with  a  supposed 
resolution  of  .01  microns,  the  masks  are  generated  with 
feature  edges  falling  on  increments  (drawing  units)  of  a 
much  lower  resolution.  This  drawing  unit  corresponds  to  the 
"spot"  size  of  the  electron-beam  used  in  the  pattern 
generation  process.  The  first  trade-off  involves  the  size 
of  the  beam  (spot  size).  The  smaller  the  spot,  the  finer 
the  grid  which  defines  the  edges  of  all  of  the  graphic 
features  and  the  higher  the  mask  generation  costs.  Reducing 
the  spot  by  a  factor  of  two  results  in  four  times  the  masks' 


cost.  The  MOSIS  constraint:  use  as  large  a  spot  size  as 
poss i bl e . 

The  largest  spot  size  available  is  slightly  over  one 
micron  —  on  the  mask.  At  the  3  micron  technology,  it  is 
common  to  use  an  1/2  micron  beam  size  —  thus  providing  a 
drawing  unit  of  1/2  micron  since  the  mask  pattern  is 
commonly  lx  compared  with  the  wafer  pattern.  At  the  1.2 
micron  technology,  wafer  steppers  are  commonly  used  with  two 
different  lens:  5x  and  lOx  reduction.  With  the  5x  lens  and 
lOx  lens,  the  maximum  beam  size  of  1  micron  corresponds  to 
drawing  units  on  the  wafer  of  0.2  micron  and  0.1  micron, 
r espec t i ve I y .  Considering  the  wider  angle  of  view  available 
at  5x  and  the  lower  cost  of  the  mask  per  design,  the  5x  is 
preferred  if  the  image  resolution  on  the  wafer  is 
satisfactory. 

A  careful  researcher  observes  that  the  desire  to 
provide  flexibility  to  accommodate  scaling  leads  to  small 
drawing  units  which  is  in  conflict  with  the  desire  to  have  a 
course  grid  for  designer  convenience.  The  solution  appears 
to  be  the  creation  of  two  grids,  a  coarse  grid  for  locating 
features  and  a  finer  grid  for  defining  the  edges  of  the 
features.  This  is  the  essential  ingredient  of  the  GRAIL 
system  at  MSU.  The  coarse  grid  provides  simplicity  for 
feature  location;  the  fine  grid  provides  the  feature  sizing 
required  for  adjusting  the  scaling  for  the  particular 
technology  or  vendor.  A  necessary  condition  is  that  the 
fine  grid  (drawing  units)  must  fall  on  the  coarse  grid 
< 1 ambda) . 

The  NMOS  lambda  rules,  illustrated  by  Mead  and  Conway, 
underscore  the  simplicity  of  designing  with  the  coarse  grid. 
Applying  this  concept  to  the  common  design  rules  for  the  3 
micron  technology  leads  to  a  lambda  of  1.5  microns. 
However,  one  observes  that  process  engineers  have  generally 
required  greater  feature  linewidths  and  spaces  for  the 
second  metal  interconnect  and  v  i  as  than  for  the  underlying 
structures.  As  a  result,  one  is  forced  either  to  represent 
the  upper  interconnect  structure  with  more  grid  units  than 
for  the  substructure  or  to  represent  the  interconnect 
structures  similarly  with  a  compromised  grid  size  and 
provide  suitable  adjustment  in  feature  sizes.  There  is 
little  question  in  the  author  s  mind  that  the  latter 
approach  is  far  easier  on  the  designer  for  comparable  layout 
densities.  For  that  reason  the  author  is  convinced  that  the 
lambda  grid  size  should  be  based  on  the  interconnection 
r epresen tat i on  concerns  and  not  on  simply  twice  the  minimum 
gate  length  as  utilized  by  Mead  and  Conway.  The  gate  length 
is  1.2  microns  in  the  1.2  micron  technology,  and  this 
argument  suggest  that  the  lambda  grid  should  be  something 
greater  than  the  0.6  microns.  Considering  the  drawing  unit 
to  be  0.2  microns  for  this  1.2  micron  technology,  the  next 
increment  for  the  lambda  grid  becomes  0.8  microns. 


CMOS 
SCALABLE 
DESIGN  RULES 
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RULES 

UNOER  CONSIDERATION  BUT 

NOT  RELEASED 

XSA 

4 

1 .225 

1  .925 

2.75 

2 

3.2 

2.67 

GSDs 

3 

1.225 

1  .4 

2 

2 

2.4 

2.00 

OSXb 

3 

-.175 

.2625 

.375 

-.4 

.4 

.33 

PSG  & 

POG  2 

1 .4875 

1  .575 

2.25 

2.6 

2.8 

2.33 

FU 

2 

2.1 

1  .575 

2.25 

3.6 

2.7 

2.25 

AO V 

1 

.9625 

.4375 

.625 

1  .8 

.9 

.75 

•JSC 

1 

.7875 

.525 

.75 

1 .4 

.9 

.75 

SSe 

3 

1  .4 

2.1 

3 

2.4 

3.6 

3.00 

DOCm* 

-.5 

.7 

.525 

.75 

1  .4 

1 

.83 

FOCm* 

.5 

.7875 

.4375 

.625 

1  .4 

.75 

.62 

FSe 

2.5 

1  .05 

1  .575 

2.25 

1  .8 

2.7 

2.25 

VSA 

1  .5 

.7875 

.9625 

1.375 

1  .2 

1  .5 

1  .25 

VSG 

1 .5 

1.1375 

1  .05 

1  .5 

2 

1  .9 

1  .58 

GOV 

1  .5 

.9625 

.7 

1 

1  .6 

1.1 

.92 

V0C2 

.5 

.4375 

.525 

.75 

.8 

.9 

.75 

SU 

2 

2.1 

1  .4 

2 

3.6 

2.4 

2.00 

NOTE: 

>The  "e" 

suffix 

indicates  the 

case  where 

two 

interconnect  lines  approach  each 

other 

at 

their 

"ends" , 

the  ends 

being 

no  greater  than 

4  1 ambda 

across . 

>The  Hs"  suffix  indicates  a  short  4  lambda  -feature 
which  has  this  minimum  space  to  another  -feature. 

>The  “m*“  su-f-fix  applies  in  a  direction  not  in  the 
direction  o f  the  metal  interconnect  line. 

>The  V0C2  applies  in  the  case  o-f  a  via  stacked  over 
a  contact  with  the  requirement  that  at  least  two 
edges  o-f  the  via  overlap  the  contact  by  V0C2. 

>The  ”b"  su-f-fix  indicates  the  special  case  -for  a 
modt-fied  butting  contact  structure  between  dogbone 
poly  and  active  with  two  contact  cuts  and  jumpered 
with  first  metal. 
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p 

BLOAT 

0 

0 

align  >A 

0 

0 

al  i gn  >A 

PSG 

&  POG 

3 

2.1875 

2.275 

3.25 

3.8 

4 

3.33 

POA 

2 

1.1375 

1 .4875 

2.125 

1  .8 

2.4 

2.00 

POX 

3 

.7875 

1.1375 

1  .625 

1  .4 

2 

1  .67 

PSA 

2 

1.1375 

1 .4875 

2.125 

1  .8 

2.4 

2.00 

PSX 

3 

.7875 

1.1375 

1  .625 

1  .4 

2 

1 .67 

POCw&PSCw 

1 

.7875 

.7 

1 

1  .4 

1  .2 

1  .00 

C 

BLOAT 

-.175 

.175 

al  i gn  >A 

-.4 

.4 

align  >G 

CW 

2 

1 .225 

1 .4 

2 

2 

2.4 

2.00 

CS 

2 

1  .575 

1  .4 

2 

2.8 

2.4 

2.00 

CSG 

2 

1.575 

1.575 

2.25 

2.8 

2.8 

2.33 

DOC 

0 

1  .05 

.875 

1.25 

2 

1  .6 

1  .33 

XOC, 

,  XOCw 

0 

1 .4 

.9625 

1  .375 

2.4 

1  .6 

1 .33 

F 

BLOAT 

.7 

-.525 

align  >C 

1  .2 

-.9 

al i gn  >C 

FU 

3 

2.8 

2.275 

3.25 

4.8 

3.9 

3.25 

FS 

3 

1  .4 

1  .925 

2.75 

2.4 

3.3 

2.75 

Fp 

6 

4.2 

4.2 

6 

7.2 

7.2 

6.00 

FOC 

1 

1.1375 

.7875 

1.125 

2 

1  .35 

1.13 

V 

BLOAT 

0 

.35 

al i gn  >F 

0 

.6 

al i gn  >F 

VU 

2 

1  .4 

1  .75 

2.5 

2.4 

3 

2.50 

VS 

2 

1  .4 

1.05 

1  .5 

2.4 

1  .8 

1  .50 

FOV 

1 

1  .05 

.6125 

.875 

1  .8 

1  .05 

.87 

VSC 

2 

1 .4875 

1  .225 

1  .75 

2.6 

2.1 

1.75 

GOV 

2 

1 .3125 

1  .05 

1  .5 

2.2 

1  .7 

1.42 

VSG 

2 

1.4875 

1  .4 

2 

2.6 

2.5 

2.08 

DOV 

1 

1 .6625 

1  .4 

2 

3 

2.5 

2.08 

VSD 

3 

1.1375 

1  .05 

1 .5 

1  .8 

1  .7 

1  .42 

VSA 

2 

1.1375 

1 .3125 

1  .875 

1  .8 

2.1 

1  .75 

AOV 

2 

1 .6625 

1.1375 

1  .625 

3 

2.1 

1 .75 

VSX 

3 

.7875 

.9625 

1  .375 

1  .4 

1.7 

1  .42 

XOV 

0 

1  .3125 

.7875 

1.125 

2.2 

1  .3 

1.08 

S 

BLOAT 

.7 

-.7 

align  >V 

1  .2 

-1.2 

al  i  gn  >V 

sw 

3 

2.8 

2.1 

3 

4.8 

3.6 

3.00 

ss 

4 

2.1 

2.8 

4 

3.6 

4.8 

4.00 

Sp 

7 

4.9 

4.9 

7 

8.4 

8.4 

7.00 

sov 

1 

1  .05 

.525 

.75 

1  .8 

.9 

.75 

*1 


«*./  •'V 
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2.6  DESIGN  RULE  CALCULATIONS  revision  1.2.5 

FOR  CMOS  DESIGN  RULES 
AT  LAMBDA  *  .7  and  1.2  MICRONS 

VARIABLE  LAMBDA  ■ 1 .2  mi  cron"  TECH  *2.4  micron"  TECH 


LABEL 

RULES 

1  LAMBDA® 

.7 

m  i  crons  1 

1  LAMBDA- 

1  .2 

micronsl 

!LAMBDA= 

4 

spots  I 

1  LAMBDA- 

3 

spots  1 

as 

mask  dim 

4 i n .dim 

4  i  n  .dim 

mask  dim 

4  in .dim 

4  in .dim 

drawn 

microns 

microns 

1 ambda 

microns 

microns 

1 ambda 

U 

BLOAT 

0 

0 

-4.8 

4.8 

UU 

6 

4.2 

4.2 

6 

2.4 

7.2 

6.00 

US 

6 

4.2 

4.2 

6 

12 

7.2 

6.00 

A 

BLOAT 

.525 

-.7 

al  i  gn  >U 

1  .2 

-1 .2 

align  >U 

AU 

2 

1 .925 

1  .225 

1  .75 

3.6 

2.4 

2.00 

AS 

3 

1.575 

2.275 

3.25 

2.4 

3.6 

3.00 

Ap 

5 

3.5 

3.5 

5 

6 

6 

5.00 

ApIUn 

6 

3.9375 

4.2875 

6.125 

9 

7.2 

6.00 

AnIUp 

6 

3.9375 

4.2875 

6.125 

4.2 

7.2 

6.00 

ApOUp 

4 

2.5375 

2.8875 

4.125 

1  .8 

4.8 

4.00 

AnOUn 

4 

2.5375 

2.8875 

4.125 

6.6 

4.8 

4.00 

X 

BLOAT 

2.625 

-.7 

align  >U 

4.4 

-1  .2 

align  >U 

XS 

6 

1.575 

2.275 

3.25 

2.8 

4 

3.33 

XpIUn 

7 

3.5875 

3.9375 

5.625 

8.6 

6.8 

5.67 

XnlUp 

7 

3.5875 

3.9375 

5.625 

3.8 

6.8 

5.67 

XpOUp 

3 

3.4125 

3.0625 

4.375 

3.4 

5.2 

4.33 

XnOUn 

3 

3.4125 

3.0625 

4.375 

8.2 

5.2 

4.33 

XSA 

5 

1.925 

2.625 

3.75 

3.2 

4.4 

3.67 

G 

BLOAT 

-.175 

-.175 

align  >A 

-.4 

-.4 

align  >A 

GW 

2 

1  .225 

1  .05 

1 .5 

2 

1  .6 

1.33 

GS 

2 

1  .575 

1  .75 

2.5 

2.8 

3.2 

2.67 

Gp 

4 

2.8 

2.8 

4 

4.8 

4.8 

4.00 

GSD 

4 

1 .925 

2.1 

3 

3.2 

3.6 

3.00 

GOA 

2 

1.05 

1  .3125 

1  .875 

1  .6 

2 

1 .67 

AOG 

2 

1 .75 

1  .4875 

2.125 

3.2 

2.8 

2.33 

GSA 

1 

.525 

.9625 

1.375 

.8 

1  .6 

1  .33 

GSX 

3 

.875 

1 .3125 

1.875 

1  .6 

2.4 

2.00 

GSXs 

2 

.175 

.6125 

.875 

.4 

1  .2 

1  .00 

D 

BLOAT 

1  .925 

-.175 

align  >A 

3.6 

-.4 

al i gn  >A 

DS 

5 

1 .575 

1  .75 

2.5 

2.4 

2.8 

2.33 

DSA 

3 

.875 

1  .3125 

1 .875 

1 .2 

2 

1  .67 

DSAs 

2 

.175 

.6125 

.875 

.0000000 

.8 

.67 

DSX 

4 

.525 

.9625 

1  .375 

.8 

1  .6 

1 .33 

RULES  UNDER  CONSIDERATION  BUT  NOT  RELEASED 


XSA 

4 

1 .4 

2.2 

2.75 

2.5 

4 

2.67 

OSDs 

3 

1 .4 

1 .6 

2 

2.5 

3 

2.00 

DSXb 

3 

-.2 

.3 

.375 

-.25 

.75 

.50 

PSG  & 

POG  2 

1.7 

1.8 

2.25 

3 

3.25 

2.17 

FU 

2 

2.4 

1.8 

2.25 

4.5 

3.375 

2.25 

AO V 

1 

1.1 

.5 

.625 

2.25 

1.125 

.75 

VSC 

1 

.9 

.6 

.75 

1 .5 

.875 

.58 

SSe 

3 

1 .6 

2.4 

3 

3 

4.5 

3.00 

DOCm* 

-.5 

.8 

.6 

.75 

1  .25 

.75 

.50 

FOCm* 

.5 

.9 

.5 

.625 

1  .5 

.6875 

.46 

FSe 

2.5 

1  .2 

1  .8 

2.25 

2.25 

3.375 

2.25 

USA 

1  .5 

.9 

1.1 

1  .375 

1  .5 

1  .875 

1  .25 

VSG 

1 .5 

1  .3 

1.2 

1.5 

2.25 

2.125 

1 .42 

GOV 

1 .5 

1.1 

.8 

1 

2.25 

1  .625 

1 .08 

V0C2 

.5 

.5 

.6 

.75 

.75 

.875 

.58 

SW 

2 

2.4 

1.6 

2 

4.5 

3 

2.00 

NOTE: 

>The  " e  “ 

suf f  i  x 

indicates  the 

case 

where 

two 

interconnect  lines  approach  each 

other 

at 

their 

"ends"  , 

the  ends 

being  no 

greater 

than  4  lambda 

across . 

>The  "s"  suffix  indicates  a  short  4  lambda  -feature 
which  has  this  minimum  space  to  another 

■feature  . 

>The  “m*"  su-f-fix  applies  in  a  direction  not  in  the 
direction  of  the  metal  interconnect  line. 

>The  V0C2  applies  in  the  case  of  a  via  stacked  over 
a  contact  with  the  requirement  that  at  least  two 
edges  of  the  uia  overlap  the  contact  by  V0C2. 

>The  "b"  suffix  indicates  the  special  case  for  a 
modified  butting  contact  structure  between  dogbone 
poly  and  active  with  two  contact  cuts  and  jumpered 
with  f i rst  metal . 
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revision  1.2.5 


DESIGN  RULE  CALCULATIONS 
FOR  CMOS  DESIGN  RULES 
AT  LAMBDA  =  .8  and  1.5  MICRONS 


VARIABLE 

LAMBDA 

1 

'1.2 

micron"  TECH 

*  3  micron*  TECH 

LABEL 

RULES 

1  LAMBDA* 

.8 

micronsl 

1  LAMBDA9 

1 .5 

micronsl 

1  LAMBDA* 

4 

spots  1 

1  LAMBDA® 

3 

spots  1 

as 

mask  dim 

•fin 

.dim 

fin. dim 

mask  dim 

f i n .dim 

f i n .dim 

drawn 

microns 

microns 

1 ambda 

microns 

microns 

1 ambda 

U 

BLOAT 

0 

0 

-6 

6 

uu 

6 

4.8 

4.8 

6 

3 

9 

6.00 

US 

6 

4.8 

4.8 

6 

15 

9 

6.00 

A 

BLOAT 

.6 

-.8 

align  >U 

1 .5 

-1  .5 

align  >U 

AU 

2 

2.2 

1.4 

1 .75 

4.5 

3 

2.00 

AS 

3 

1 .8 

2.6 

3.25 

3 

4.5 

3.00 

Ap 

5 

4 

4 

5 

7.5 

7.5 

5.00 

ApIUn 

6 

4.5 

4.9 

6.125 

11.25 

9 

6.00 

AnIUp 

6 

4.5 

4.9 

6.125 

5.25 

9 

6.00 

ApOUp 

4 

2.9 

3.3 

4.125 

2.25 

6 

4.00 

AnOUn 

4 

2.9 

3.3 

4.125 

8.25 

6 

4.00 

X 

BLOAT 

3 

-.8 

al  i  gn  >U 

5.5 

-1  .5 

align  >U 

XS 

6 

1.8 

2.6 

3.25 

3.5 

5 

3.33 

XpIUn 

7 

4.1 

4.5 

5.625 

10.75 

8.5 

5.67 

XnlUp 

7 

4.1 

4.5 

5.625 

4.75 

8 

.5 

5.67 

XpOUp 

3 

3.9 

3.5 

4.375 

4.25 

6.5 

4.33 

XnOUn 

3 

3.9 

3.5 

4.375 

10.25 

6.5 

4.33 

XSA 

5 

2.2 

3 

3.75 

4 

5.5 

3.67 

G 

BLOAT 

-.2 

-.2 

align  >A 

0 

-.5 

align  >A 

GU 

2 

1.4 

1.2 

1 .5 

3 

2.5 

1  .67 

GS 

2 

1 .8 

2 

2.5 

3 

3.5 

2.33 

Gp 

4 

3.2 

3.2 

4 

6 

6 

4.00 

GSD 

4 

2.2 

2.4 

3 

4 

4.5 

3.00 

GOA 

2 

1  .2 

1  .5 

1.875 

2.25 

2.75 

1.83 

AOG 

2 

2 

I  .7 

2.125 

3.75 

3.25 

2.17 

GSA 

1 

.6 

1.1 

1  .375 

.75 

1  .75 

1.17 

GSX 

3 

1 

1  .5 

1  .875 

1.75 

2.75 

1  .83 

GSXs 

2 

.2 

.7 

.875 

.25 

1  .25 

.83 

D 

BLOAT 

2.2 

-.2 

align  )A 

4 

-.5 

align  >A 

DS 

5 

1  .8 

2 

2.5 

3.5 

4 

2.67 

bloat 


METAL2 

S 

MIN. WIDTH 

sw 

MIN. SPACE 

ss 

MIN. PITCH 

Sp 

VIA  OVERLAP 

sov 

3 

4 
7 
1 


1 

4 

3 

7 


1.5 


-1  al i gn  >V 

3 

4 
7 


.75  .7 


NOTE:  >ALL  EDGES  of  features  must  be  defined  on  the  lambda 

grid. 

>If  feature  edges  are  separated  by  an  odd  number  of 
half  grids  within  a  layer,  the  edges  will  "flop"  in 
a  random  direction  by  1/2  spot  due  to  the  round  off 
in  the  pattern  generation. 

>There  are  two  additional  layers  referenced  in  the 
design  rule  calculations  on  the  following  pages, 
both  of  which  are  transparent  to  the  designer.  The 
X  1  ayer  is  generated  from  a  bloat  of  the  active 
contact  Ca  and  represents  the  active  dogbone  around 
the  contact.  This  X  layer  is  merged  with  the 
designers  A  1  ayer ,  after  its  bloat,  to  form  the 
real  ACTIVE  mask  1 ayer .  The  D  layer  is  generated 
from  a  bloat  of  the  poly  contact  Cp  and  represents 
the  poly  dogbone  around  the  contact.  This  D  1 ayer 
is  merged  with  the  designers  G  layer,  after  its 
bloat,  to  form  the  real  P0LY1  mask  layer.  This 
procedure  in  the  mask  generation  process 
facilitates  an  optimizing  of  the  spacing  between 
features  for  a  particular  technology  and  does  not 
ordinarily  concern  the  designer.  The  sizing  of 
these  features  are  calculated  on  the  following 
pages . 

J.  Donald  Trotter,  Mississippi  State  University, 
<601)325-3751 


NOTE:  >C  =  Ca  +  Cp  The  Ca  and  Cp  "design"  layer*  are 

the  contacts  to  Active  and  Polyl,  respectively.  The  Ca  and  Cp 
"design"  layers  merge  on  the  mask  to  form  the  Contact  mask 
layer.  "C"  without  a  subscript  refers  to  both  "design"  layers. 


CONTACT  (ACTIVE)  Ca  bloat 

-.25 

.25  al i gn 

>A 

WIDTH  CaW 

2 

1 .75 

2 

MIN. SPACE  CaS 

2 

2.25 

2 

ACTIVE  TO  A. CONTACT  ASCa 

5 

4.75 

5.125 

A.CONT  TO  POLY  GATE  CaSG 

2 

2.25 

2.25 

1 

ACTIVE  OVERLAP  AOCa 

1 

2 

1  .375 

.7 

FIELD  POLY  TO  A.CONT.  GfSCa 

3 

3.25 

3.25 

1 

above  for  <  5  lambda  GfSCas 

2 

2.25 

2.25 

1 

NOTE:  >GfSCas  and  CgSAs  allow  one 

contact  only. 

CONTACT  ( POLY 1 )  Cp  bloat 

-.25 

.25  al i gn 

>A 

WIDTH  CpW 

2 

1.75 

2 

MIN. SPACE  CpS 

2 

2.25 

2 

POLY  CONT  TO  ACTIVE  CpSA 

3 

2.75 

3.125 

.7 

above  for  <  5  lambda  CpSAs 

2 

1 .75 

2.125 

.7 

P .CONT  TO  A.CONT  CpSCa 

4 

4.25 

4 

POLY1  OVERLAP 

GOCp 

1 

1  .5 

1  .25 

1 

SPACE  TO  P. CONTACT  GSCp 

4 

4.25 

4.25 

1 

NOTE:  >Both  N-Well  and 

P-Well  contacts 

are  required. 

>The  shorting  contact  is 

not  allowed. 

>0nly  the 

2x2  (lambda)  contact  is  allowed. 

METAL 1 

F 

bl  oat 

1 

-.75 

al 

i  gn  >C 

MIN. WIDTH 

FW 

3 

4 

3.25 

MIN. SPACE 

FS 

3 

2 

2.75 

MIN. PITCH 

Fp 

6 

6 

6 

CONTACT  OVERLAP 

FOC 

1 

1 .625 

1.125 

.7 

VIA 

V 

bl  oat 

0 

.5 

al 

i  gn  >  F 

VIA  WIDTH 

VW 

2 

2 

2.5 

VIA  SPACE 

VS 

2 

2 

1  .5 

1st  METAL  OVERLAP 

FOV 

1 

1  .5 

.875 

.7 

VIA  SPACE  TO  CONT. 

VSC 

2 

2.125 

1  .75 

1 

GATE  OVERLAP  (VIA) 

GOV 

2 

1  .875 

1  .5 

1  .4 

SPACE  TO  POLY  GATE 

VSG 

2 

2.125 

2 

1  .4 

VIA  TO  ACTIVE 

VSA 

2 

1  .625 

1.875 

1  .2 

ACTIVE  OVERLAP(VIA)  AOV 

2 

2.375 

1  .625 

1  .2 

NOTE:  >Vias  over  active  area  are  permitted. 

>0n1y  a  via  size  of  <2x2)  is  allowed. 

>The  stacked  contact  (a  via  stacked  over  a  contact)  is  not 
al lowed. 


2.4  CMOS  Design  Rules  for  Lambda-Based  Designs 


These  lambda-based  design  rules  facilitate  designing 
for  the  1.2  micron  technology  and  processing  at  either  1.2 
or  3  microns.  The  lambda  dimensions  are  for  convenience  and 
do  not  correspond  to  mask  or  finished  dimensions. 

Bloat  for  each  1 ayer  refers  to  the  expansion  in  total 
linewidth,  going  from  the  drawn  dimension  to  the  mask 
dimension,  and  then  to  the  finished  dimension  on  the 
finished  wafer.  As  an  example,  the  minimum  active  line 
width  is  drawn  as  2  lambda.  It  is  bloated  to  2.75  lambda  at 
the  mask  and  then  shrinks  1  lambda  in  wafer  processing  to  a 
finished  wafer  linewidth  of  1.75  lambda. 


DESCRIPTION 

VARIABLE 

LAMBDA  RULES 

Assumed 

LABEL 

Drawn 

Mask 

Fi  n i shed 

Mi  sal i 

i  gnment 

WELL 

W 

bl  oat 

0 

0 

WELL  WIDTH 

WW 

6 

6 

6 

WELL  SPACING 

WS 

6 

6 

6 

ACTIVE  TRANSISTOR 

A 

bl  oat 

.75 

-1 

al  i  gn 

>W 

MIN. WIDTH 

AW 

2 

2.75 

1  .75 

MIN. SPACE 

AS 

3 

2.25 

3.25 

MIN. PITCH 

Ap 

5 

5 

5 

P+ INSIDE  N  WELL 

ApIWn 

6 

5.625 

6.125 

.7 

N+ INSIDE  P  WELL 

AnIWp 

6 

5.625 

6.125 

.7 

P+  Overlap  P  WELL 

ApOWp 

4 

3.625 

4.125 

.7 

N+  Overlap  N  WELL 

AnOWn 

4 

3.625 

4.125 

.7 

GATE 

G 

bl  oat 

-.25 

-.25 

align 

>A 

MIN. WIDTH 

GW 

2 

1  .75 

1.5 

MIN. SPACE 

GS 

2 

2.25 

2.5 

MIN. PITCH 

Gp 

4 

4 

4 

GATE  OVERLAP 

GOA 

2 

1  .5 

1  .875 

.7 

ACTIVE  0.  of  GATE 

AOG 

2 

2.5 

2.125 

.7 

POLY  TO  ACTIVE 

GSA 

1 

.75 

1  .375 

.7 

P+ 

P 

bl  oat 

0 

0 

align 

>A 

SPACE  TO  Op. Gate 

PSG  4  POG 

3 

3.125 

3.25 

1 

OVERLAP  OF  ACTIVE 

POA 

2 

1  .625 

2.125 

.7 

SPACE  TO  Op .Act i ve 

PSA 

2 

1  .625 

2.125 

.7 

WELL  CONT. OVERLAP 

POCw&PSCw 

1 

1.125 

1 

1 

reproduction  purposes.  These  calculations  Mere  performed 
using  a  spreadsheet  to  facilitate  the  what-if  nature  of  the 
exercise.  Emphasis  is  on  the  1.2  microns  technology  in 
conjunction  with  the  3  microns;  however,  two  additional 
levels  of  scaling  are  purposed  for  consideration  by  MOSIS:  a 
more  aggressive  1.2  microns,  corresponding  to  VHSIC 
dimensions  (roughly),  and  2  microns  <or  2.4  microns, 
depending  on  your  definition),  corresponding  to  the  believed 
present  lower  limit  for  full  wafer  exposure. 

The  rules  are  intended  to  support  N-well,  P-we 1 1 ,  and 
twin-tub  processing,  although  the  exact  interface  to  MOSIS 
for  the  N-well  is  still  to  be  determined  in  the  next  few 
months.  A  design  to  accommodate  all  varieties  must  have 
appropriate  well  (substrate)  contacts  for  both  polarities. 
A  well  around  the  die  with  the  same  polarity  as  the 
substrate  is  required  to  prevent  junction  leakage  at  the 
edge  of  the  chip.  It  is  still  to  be  determined  if  MOSIS  or 
the  designer  is  expected  to  provide  this  feature.  The 
nominal  drawn  P-we 1 1  is  drawn  equal  distance  between  the 
NMOS  and  PMOS  transistors.  Appropriate  bloating  and/or 
shrinking  is  to  be  provided  by  MOSIS. 


base  can  control  the  flop  to  always  be  in  a  consistent 
direction  by  disallowing  the  exact  1/2  spot  location. 

Consider  a  very  important  example.  In  the  Mead  and 
Conway  rules,  the  active  layer  overlap  of  the  contact  mask 
is  drawn  as  one  lambda.  However,  as  the  technologies  scale 
down,  the  alignment  errors  of  the  alignment  jigs  become  a 
more  significant  portion  of  this  vverlap  tolerance.  For 
fear  of  yield  losses  due  to  the  metal  shorting  to  the 
substrate  by  virtue  of  the  metal  contacting  the  substrate, 
some  vendors  have  increased  the  requirement  for  the  active 
overlap  of  the  contact  on  a  relative  basis  as  the  technology 
is  scaled  down.  How  should  the  designer  draw  the  overlap? 
1.5  lambda  maybe,  since  that  corresponds  to  an  acceptable 
increase!  That  half  lambda  edge  will  not  result  in  an 

uncontrolled  flop  of  the  feature  at  the  1.2  microns  since 
there  are  two  drawing  units  in  the  half  lambda.  However,  at 
the  3  micron  technology,  the  edge  falls  in  the  middle  of  the 
spot.  It  can  be  controlled  to  always  flop  in  the  positive 
direction  or  vice  versa,  however,  what  is  needed  is  for  the 
feature  edge  to  move  relative  to  the  position  of  the 
contact.  How  is  this  to  be  achieved? 

Bloating  of  the  overlap  structures  around  the  contacts 
is  a  key  requirement  in  providing  the  flexibility  for 
scaling.  Should  MOSIS  bloat,  as  an  example,  the  entire 
active  region  or  just  the  active  around  the  contact?  How 
should  MOSIS  handle  the  poly  1 ayer  since  the  gate  length 
needs  to  be  independently  controlled  versus  the  poly  overlap 
of  the  contact  (poly  dogbone)? 

After  considerable  effort,  discussions  with  MOSIS 
programmers,  and  discussions  with  designers  in  the 

community,  the  collective  decision  was  made  to  draw  for 
convenience  with  an  active  overlap  and  poly  overlap  of  one 
lambda.  In  addition,  the  designer  is  asked  to  distinguish 

the  poly  contact  from  the  active  contact  in  order  for  MOSIS 

to  more  easily  individually  bloat  the  four  structures: 
active  and  transistors,  active  contacts,  poly  and  poly 

gates,  and  poly  contacts.  As  an  example,  in  order  for  MOSIS 

to  generate  the  active  mask  definition  for  the  pattern 
generator,  MOSIS  will  bloat  all  the  active  contacts  an 
appropriate  amount  and  merge  these  features  with  the 

separately  bloated  active  features.  The  poly  layer  is 
treated  similarly.  Although  MOSIS  could  extract  the 

distinction  of  the  contacts  from  logic  operations  on  the 
data  base,  the  extra  computing  cost  redirects  the  solution 
to  the  designer  maintaining  that  information  with  the  design 
subm i ss i on . 

With  the  philosophy  worked  out,  what  remains  is  the 
determination  of  the  MOSIS  bloats  and  shrinks  on  each  layer 
and  the  individual  design  rules.  The  next  two  sections 
contain  this  information  in  a  rather  compact  form  for 
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Without  proof,  the  following  observation  is  provided: 
Twice  lambda  minus  one  drawing  unit  should  equal  the  minimum 
resolution  of  the  lithography.  This  underscores  the  ability 
to  increase  (bloat)  the  size  of  certain  neighboring  features 
by  one  drawing  unit  per  edge  without  violating  the  minimum 
resolution  requirement. 

These  observations  with  lots  of  detailed  layouts  and 
playing  the  "what-if"  game  have  led  to  the  following 
selections  (in  microns): 


technol ogy 


1.2  microns 

3  m i crons 

1 ambda 

o 

■ 

00 

1  .5 

drawing  unit 

0.2 

0.5 

spots/1 ambda 

4 

3 

minimum  resolution  1.4  2.5 


I 


i 


I 


These  arguments  have  led  to  the  basis  for  the  feature  sizing 
desired  on  the  mask  and  which  support  rather  straight 
forward  extension  of  the  original  Mead  and  Conway 
representation  of  device  and  interconnect  features. 
However,  there  is  the  question:  based  on  the  provided  lambda 
design,  by  what  algorithms  are  the  features  sized  by  MOSIS  ? 

There  are  at  least  six  different  design  methodologies 
used  in  the  community  to  develop  and  define  the  mask  artwork 
information.  Center  defined  features  provide  the  most 
simple  extension  to  bloating  (or  shrinking)  the  features 
required  by  MOSIS;  however,  MAGIC  and  CAESAR  are  two  edge 
defined  systems  essential  to  the  community.  A  solution  is 
necessary  to  accommodate  these  systems. 

A  little  detail  that  is  important  in  the  MOSIS  system 
is  that  there  cannot  be  an  absolute  reference  dependency  in 
sizing  the  features  since  MOSIS  relocates  designs 
independently  of  the  lambda  grid.  What  happens  when  the 
edge  definition  of  a  feature  does  not  fall  on  the  edge  of 
the  raster  for  the  e-beam?  The  software  rounds  the  feature 
edge  off  to  the  nearest  raster  edge.  In  other  words,  the 
feature  "flops"  to  the  nearest  raster.  If  the  feature  size 
is  an  integer  number  of  spot  sizes  (drawing  units),  the 
pattern  will  flop  toward  the  smallest  distance  but  still 
maintain  the  feature  size  as  drawn.  If  the  feature  edge 
falls  directly  in  the  middle  of  a  spot,  the  flop  will  be  in 
a  random  direction  with  half  of  a  drawing  unit  displacement. 
Providing  a  very  small  bias  to  all  the  numbers  in  the  data 
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interconnections  and  transistors.  However,  clocked  CMOS, 
with  the  concept  of  precharging  circuit  nodes  and  then 
conditionally  discharging  them,  requires  only  one 
transistor  per  signal  input  C33.  PLA  design  with  clocked 
CMOS  has  been  developed  with  this  distinct  advantage  in 
mi  nd. 


The  Path  Programmable  Logic  <PPL>  is  a  variation  of  a 
PLA  in  which  the  AND  array  and  the  OR  array  are  folded 
together  so  that  input  lines  and  output  lines  are 
alternated  within  a  single  array.  Column  and  row  breaks  in 
the  AND  and  OR  planes  allow  the  design  of  independent 
arrays  on  the  same  integrated  circuit.  This  concept;  which 
is  familiar  as  Storage/Logic  Array  (S' <0,  was  originally 
published  by  Suhas  Patil  C13.  This  project  also  explores 
the  possibility  of  implementing  a  PPL  with  CMOS  and 
compares  it  with  other  PLA  implementations. 


The  process  used  for  fabrication  is  a  silicon-gate 
P-well  process  with  double  1 ayer  metal.  The  lambda  based 
scalable  design  rules  facilitate  designing  for  1.2  micron 
and  then  processing  at  either  1 .2  or  3  microns.  The  legend 
of  the  layers  used  and  the  documentation  of  the  design 
rules  are  included  in  Appendix  B. 


INPUT  LATCHES 


CUTPUT  LATCHES 


Figure  1.1  General  PLA  Block  Diagram 


This  project  explores  PLA  design  using  CMOS 
technology.  Classic  CMOS  circuitry  requires  signals  to 
drive  pairs  o-f  transistors  instead  o-f  one,  as  is  normally 


used  in  NMOS . 


This  requires  greater 


area 


■f  or 
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CHAPTER  I 

INTRODUCTION 

The  advent  of  Mery  Large  Scale  Integrated  (VLSI) 
circuit  technology  illustrates  the  continuing  trend  toward 
increased  gate  counts  on  logic  chips.  Because  o-f  the 
complexity  and  cost  o-f  designing  such  chips,  a  structured 
form  o-f  logic  implementation  is  desirable  Cl].  A 
Programmable  Logic  Array  <PLA>  is  an  orderly  structure  for 
handling  combinational  sequential  logic  functions  C21. 
PLA's  have  became  increasingly  popular  as  a  means  of 
implementing  complex  logic  with  reduced  engineering  costs 
C33.  The  PLA  implementation  has  the  advantage  of  giving 
designers  more  flexibility  because  logic  changes  can  be 
made  easily  and  quickly.  PLA's  can  realize  arbi trary 
logical  functions  and  can  be  used  in  mi croprogrammi ng 
controllers  as  well  as  in  random  logic  circuits. 

The  PLA,  which  is  conceptually  a  two-level  AND-OR,  is 
attractive  in  LSI  because  of  its  memory-like  regular 
structure  [4].  In  contrast  with  ROM,  only  implicants  and 
not  minterms  of  boolean  functions  are  stored,  so  that  it  is 
not  necessary  to  have  a  word  of  storage  for  every  input 
combination.  The  overall  structure  of  a  PLA  is  illustrated 
in  Figure  1.1.  The  inputs  provided  with  temporary  storage 
during  phase  1  by  input  latches  are  run  through  the  AND 
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ABSTRACT 


Various  Programmable  Logic  Array  (PLA)  implementations 
with  clocked  CMOS  technology  art  explored  in  this  project. 
Three  different  CMOS  PLA  circuit  styles  are  described:  the 
■large"  PLA  uses  a  gated  OR  plane  and  is  useful  for  a 
system  with  large  number  of  inputs;  the  "moderate”  PLA  and 
the  "small"  PLA  are  ripple  varieties  with  the  former  having 
the  capability  of  handling  a  larger  number  of  inputs  than 
the  latter.  Path  Programmable  Logic  (PPL),  which  is  a 
folded  form  of  a  PLA,  is  also  studied.  A  symbolic  form  of 
representation  is  developed  and  future  PPL  development 
activities  are  discussed.  The  PPL  approach  has  a  size  and 
flexibility  advantage  over  the  other  PLA  approaches  - 
except  in  applications  requiring  large  PLA's. 


circuit  schematics  and  layouts.  Chapter  VI  discusses  in 
detail  the  advantages  and  disadvantages  o-f  all  the  PLA's 
designed,  relative  to  their  application,  circuit  area  and 
performance . 
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PREFACE 


This  project  explores  custom  CMOS  design  methodologies 
for  different  Programmable  Logic  Array  <PLA> 
implementations  often  used  in  the  circuitry  which  controls 
the  functional  blocks  of  Microcomputers. 


Chapter  I  reviews  the  basic  theory  of  PLA 
implementation  and  discusses  CMOS  designs  and  the 
fabrication  process.  Chapter  II  explains  clocked-CMOS 
circuit  concepts,  both  ripple  and  gated  versions.  Chapter 
III  describes  a  PLA  design  using  a  paral 1  el -transi stor 
configuration  suitable  for  a  large  number  of  inputs.  To 
program  this  PLA,  a  pascal  program  is  developed  and 
documented  in  Appendix  A.  Chapter  IV  describes  ripple  PLA 
designs  suitable  for  a  small  number  of  i nputs/impl i cants. 
The  ‘small*  PLA  version  described  can  accommodate  a  smaller 
number  of  impli cants,  but  it  is  faster  than  the  other  PLA's 
described.  The  "moderate*  size  PLA  is  designed  to 
accommodate  a  1  arger  number  of  inputs  at  the  cost  of 
additional  circuitry.  Chapter  V  discusses  Path 
Programmable  Logic  <PPL>;  a  PPL  cell  set  is  defined  and  a 
design  approach  using  symbolic  represen  tat  i  on  is  developed. 
In  Appendix  B  lambda-based  scalable  design  rules  used  for 
the  above  designs  are  documented,  and  exceptions  to  the 
design  rules  are  discussed.  The  various  cells  c*  all  PLA's 
discussed  are  provided  in  Appendix  C,  which  contains 
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2.1  Clocked  CMOS  Circuits 


Clocked  CMOS  circuitry  is  based  on  the  concept  of 
precharged  gates.  A  three  input  NOR  gate  can  be  implemented 
with  three  NMOS  transistors  in  parallel  and  a  single  PMOS 
transistor  as  load  or  precharge  device  (Figure  2.1).  The 
PMOS  transistor  precharges  the  output  node  unconditionally 
high  (logic  *1")  when  phase  1  goes  high.  The  logic  is 
evaluated  by  the  NWOS  transistors  which  conditionally 
discharge  the  output  node  to  ground.  It  should  be  noted 
that  the  precharge  device  should  be  OFF  when  the  logic  is 
being  evaluated.  Therefore  logic  is  evaluated  when  phasel' 
i s  h i gh .  As  the  NMOS  transistors  are  in  parallel,  any  one 
o-f  the  inputs  going  high  discharges  the  output  node,  hence 
the  NOR  -function.  I-f  the  evaluation  logic  does  not  offer  a 
path  to  ground,  the  output  node  remains  in  the  precharged 
state.  The  output  node  with  an  associated  leakage  time 
constant  is  dynamic  in  nature  and  should  be  sampled  with 
appropriate  timing  constraints.  This  type  of  circuit  will 
be  referred  to  as  N-gate.  The  NMOS  devices  arranged  in 
series  provide  the  NAND  function.  A  more  general  analysis 
of  the  circuit  can  be  done  in  a  similar  fashion. 

The  use  of  an  evaluation  device  (Figure  2.3)  in 


Clocked 
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series  with  the  logic  evaluation  block  allows  precharging 
without  requiring  all  of  the  NMOS  gates  to  be  turned  off. 
No  static  power  drain  is  encountered  other  than  -from 
leakage.  Applying  a  high  signal  to  the  evaluation  device 
enables  this  "gated*  device. 

The  dual  o-f  a  N-gate  is  a  P-gate.  The  logic  is 
evaluated  using  PMOS  transistors  and  the  precharge 
<pre-di scharge)  device  is  a  NMOS  transistor.  The  output  is 
uncondi t i onal 1 y  precharged  low  (logic  "0")  by  the  NMOS 
transistor  during  the  precharge  period  and  condi t i onal 1 y 
charged  high  by  the  PMOS  transistors  during  the  evaluation 
period.  If  the  evaluation  logic  fails  to  charge  the  output 
node,  it  remains  in  the  precharged  state.  As  in  classic 
CMOS,  the  ser i es-paral  1  el  combination  of  PMOS  transistors 
must  be  the  dual  of  the  paral 1  el -ser i es  combination  of  NMOS 
transistors  for  the  same  logic  function.  For  example,  NMOS 
transistors  in  parallel  provide  the  NOR  function,  whereas 
PMOS  transistors  in  series  form  the  NOR  function.  This  is 
illustrated  in  Figure  2.1. 

N-gate  can  drive  or  be  driven  by  a  P-gate.  This 
characteristic  comes  from  the  fact  that  the  evaluation 
devices  of  the  P-gate  should  be  OFF  during  the  precharge 
phase,  which  means  that  active  high  gate  inputs  are  needed. 
This  is  possible  if  these  gates  are  driven  with  N-gate 
outputs  which  are  precharged  high.  Therefore  a  string  of 


clocked  gates  of  alternating  types  can  be  precharged  during 
the  precharge  phase,  and  during  the  evaluation  phase  logic 
signals  propagate  in  a  ripple  through  (asynchronous) 
manner.  A  notable  advantage  o-f  this  concept  is  the  ease  o-f 
running  successive  stages.  Addition  of  an  evaluation 
device  to  the  gates  increases  l.ogic  flexibility  (for 
example,  N-gate  driving  an  N-gate),  although  the 
asynchronous  nature  of  the  circuitry  is  sacrificed. 


2.2  Types  of  CMOS  gates 

Clocked  CMOS  gates  can  be  divided  mainly  into  two 
families:  ripple  variety  without  the  evaluation  device;  and 
the  gated  variety  with  the  evaluation  device.  Figure  2.2 
and  Figure  2.3  illustrate  two  versions  of  each  variety. 
Dual  versions  work  on  similar  grounds.  It  should  be  noted 
that  similar  series-connected  gate  versions  also  exist. 

2.2.1  Ripple  circui  ts 

The  first  circuit  of  Figure  2.2  is  NRi  ,  which 
indicates  N-gate,  Ripple  and  inverting  output.  The  inputs 
are  precharged  low,  the  output  is  precharged  high,  and  the 
logic  evaluated  is  NOR.  The  second  ripple  variety  is  NRn , 
which  indicates  non-inverting  output.  The  inputs  and 


output  are  precharged  low,  and  the  logic  function  is  OR. 
Since  the  output  voltage  swing  is  limited  by  the  device's 
threshold  voltage,  this  version  is  in  general  not  useful, 


and  is  used  only  in  special  circumstances. 


2.2.2  Gated  c i rcu i ts 

The  difference  between  the  two  gated  versions  as 
illustrated  in  Figure  2.3  is  the  location  o-f  the  evaluation 
device.  Both  o-f  them  provide  the  NOR  logic  -function.  NGa 
version  indicates  that  its  inputs  must  be  driven  -from  an 
“active11  sourcing  output  to  prevent  charge  splitting.  That 
is,  the  outputs  driving  this  gates  must  be  able  to  supply 
current  -from  the  positive  supply  for  a  “1“  input  when  the 
evaluation  device  is  turned  on.  The  output  of  the  NGa  gate 
is  precharged  high  during  the  precharge  phase.  At  this 
time,  even  one  input  precharged  high  results  in  the  common 
sources  of  the  logic  devices  being  precharged  high  to 
within  one  threshold  below  the  supply  voltage.  When  the 
evaluation  device  is  turned  on,  the  source  node  is  pulled 
to  ground  with  a  coupling  through  the  full  gate  capacitance 
to  the  input  nodes.  If  there  is  no  active  source  for 
charging  this  coupling  capacitance,  then  the  input  signal 
level  is  pulled  toward  ground,  thereby  reducing  the 
effective  drive  of  the  input.  A  classic  inverter  or  a 
P-gate  can  provide  the  full  input  drive  voltage. 

The  second  gated  variety  is  NGc ,  which  indicates  that 
large  capacitance  load  is  required  when  the  fan-in  is 
large,  in  order  to  minimize  charge  splitting.  During 
precharge,  the  output  is  orecharged  high,  and  the  common 
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3.1  Parallel  gates  PLA 

Large  PLA's  require  1 arge  fan-in  (F/I).  For  large 
-fan-ins  the  speed  of  many  series  gates  may  be  disastrous. 
The  alternative  is  to  use  parallel  gates. 

The  four  basic  parallel  gate  combinations  possible  for 
implementing  the  OR  plane  in  ripple  through  fashion  are: 

1)  NOR  2)  OR  3)  NOT-NAND  4)  NOT -AND 


Figure  3  . 1  Possible  combinations  for  the  OR  plane 

In  Figure  3.1  NMOS  devices  are  the  logic  evaluating 
devices  for  NOR/OR  logic  and  similarly  PMOS  devices  are  the 
logic  evaluating  devices  for  NAND/AND  logic  for  parallel 
configuration.  Input  gates  for  NMOS  devices  should  be 
precharged  low  and  that  of  PMOS  devices  precharged  high 


One  way  o-f  implementing  the  PLA  using  parallel  gates 
with  OR  plane  gated  is  shown  in  Figure  3.3.  The  timing  -for 


this  PLA  is  illustrated  in  Figure  3.4. 


phase  1 


phase  2 


phase  Id 

Pracharga  Evaluate  Evaluate 
PLA  and  AND  OR 

latch  inputs 

Figure  3  .4  Timing  diagram  of  gated  PLA 

The  AND  plane  is  considered  to  be  o-f  N-gate  variety 
and  OR  plane  o-f  P-gate  variety.  A  single  master  clock  is 
considered  -for  this  dynamic  PLA.  During  phasel  the  inputs 
are  latched  and  all  the  input  nodes  and  output  nodes  of  the 
AND  array  and  OR  array  are  precharged  to  their  respective 
standby  states.  During  phase2  (phasel''),  the  AND  plane  is 
evaluated  in  a  ripple  through  -fashion.  The  clock  signal 
associated  with  the  OR  plane  is  phaseld.  In  accordance 


with  the  PGa  requ i remen ts ,  the  evaluation  transistor  should 
be  ON  only  a-fter  the  AND  plane  logic  is  available  at  the  OR 
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plane  inputs.  As  the  OR  plane  inputs  are  precharged  low. 
the  evaluation  o-f  OR  plane  before  AND  plane  evaluation 
leads  to  con-fusing  results.  Phaseld  provides  the  required 
delay  -for  the  OR  plane.  Then  the  OR  plane  logic  is 
evaluated  and  the  in-formation  is  available  in  the  output 
-f  1  i  p— f  1  ops. 

3.2  TYPICAL  CELLS 

3.2.1  Input  Ci rcu i t 


Figure  3  .5  Schematic  of  input  latch 

The  input  circuit  is  shown  in  Figure  3.5.  The 
switch  notation  with  *C"  re-fers  to  the  CMOS  transmission 


gate  which  utilizes  common-source  operation  for  both  the 
devices  to  provide  rail  to  rail  coupling.  The  operation  of 
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the  switch  can  be  better  understood  -from  the  two  equivalent 
circuits  shown  in  Figure  3.6.  The  circuit  o-f  Figure  3.6a 
applies  when  the  incoming  signal  is  high  and  that  o-f  Figure 
3.6b  applies  when  the  incoming  signal  is  low.  Therefore 
■for  a  CMOS  trans-fer  gate,  it  is  always  considered  that  the 


drai  n 

o-f  one  device 

i  s 

connected 

to  the  source  o-f 

the 

other ; 

this  enables 

•ful  1 

vol tage 

swing,  a  condition 

not 

possible  with  a  single  polarity  trans-fer  gate. 


Input  ' 1 ' 


Output 


L-^j^phas. 


;q  2 


Output  Input  '0' 

(a)  for  positive  input  Cb>  for  negative  input 
Figure  3  .6  CMOS  Transmission  gate 


The  rest  o-f  the  circuit  is  NGc  gate  and  works  on  the 
grounds  discussed  already.  On  the  -falling  edge  o-f  phase2, 
the  output  is  precharged  high,  and  it  is  conditionally 
discharged  low  on  the  rising  edge  o-f  phase2.  The  logic 


avail abl e  i s  A" . 


of  the  programmed  cells,  which  are  appended  to  the  standard 
PLA  layout.  Figure  3.14  shows  the  layout  with  these 
transistors  programmed  in.  The  diffusion  cells  which  are 
programmed  in,  called  as  Acells  and  Ocells  for  the  AND 
plane  and  OR  plane  respect i vel y ,  are  shown  in  Figure  3.15. 

3.2.3  Delay  Ci rcu i t 

The  delay  signal  phase  Id  is  generated  by  this 
circuit.  The  main  functions  of  the  delay  signal  can  be 
better  understood  from  the  timing  diagram  in  Figure  3.4. 
Signal  phaseld  is  used  by  the  OR  plane  and  output  circuit. 
It  should  precharge  both  these  circuits  when  the  rest  of 
the  PLA  is  in  the  precharge  phase.  So  phaseld  will  be  high 
following  the  clock  signal  phase  1 .  When  phasel  goes  low, 
it  implies  the  evaluation  phase.  But  phaseld  remains  high 
for  a  little  more  time  until  th  AND  plane  is  evaluated. 
Then  it  goes  low  for  OR  plane  evaluation  and  output 
latching.  Again  phasel  going  high  or  phase2  going  low 
should  pull  phaseld  high  for  precharge,  thus  repeating  the 
cycl e . 

The  circuit  used  to  generate  a  signal  satisfying  the 
above  requirements  is  shown  in  Figure  3.16.  All  the  NOR 
inputs  except  one  are  connected  to  ground  so  that  all  these 
transistors  are  OFF  at  all  times.  This  way  the  output  is 
delayed  without  affecting  the  logic.  Thus  a  worst-case 


At  every  location  in  the  AND  plane,  there  are 
three  possibilities:  a  transistor  which  is  turned  on  -for  a 
logic  "1“  input,  a  transistor  which  is  turned  on  -for  a 
logic  ”0*  input,  and  no  transistor  at  all.  These  are 
represented  as  1,  0,  and  x  respectively.  For  every 

location  in  the  OR  plane  there  are  two  possibilities: 
either  the  presence  or  the  absence  of  a  transistor.  They 
are  represented  as  1  and  0  respect  i  vel  y .  In  Figure  3.13 
there  are  no  transistors  in  the  AND  and  OR  planes.  It 
should  be  noted  that  in  the  AND  plane  o-f  Figure  3.13,  there 
are  only  two  inputs  and  their  complements.  For  example, 
the  AND  and  OR  planes  are  programmed  with  the  following 
logic: 


AND 

OR 

i nputs 

i  nputs 

0  0 

10  0  0 

0  1 

110  0 

1  0 

0  0  10 

1  1 

0  0  0  1 

The  PASCAL  program  of  Appendix  A  allows  programming  in 
diffusion,  thereby  creating  a  transistor  wherever 
necessary.  The  PASCAL  program  takes  the  above  information 
as  input  data  and  generates  MSLL  (Mississippi  State  Layout 
Language)  code  which  in  turn  gives  the  layout  description 
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both  precharged  low.  Since  the  inputs  are  precharged  low, 
and  the  logic  is  evaluated  by  PMOS  transistors,  the 
evaluation  transistor  which  is  also  a  PMOS  transistor 
should  not  turn  on  until  the  AND  plane  logic  is  available 
at  the  OR  plane  inputs.  This  signal  phaseld,  which 
controls  the  OR  plane  is  generated  by  the  delay  circuit. 
The  OR  plane  outputs  feed  the  output  latches.  Figure  3.12 
shows  the  schematic  of  the  AND-OR  plane.  The  layout  with 
four  inputs,  four  imp! i cants,  and  four  outputs  is  shown  in 
Figure  3.13. 


Figure  3  .  12  AND-OR  circuit 
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Active  areas  are  programmed  with  the  program  given  in 
Appendix  A,  so  that  the  circuit  provides  the  combinations 
mentioned  above.  Out  o-f  the  twelve  cells  termed  NOR  cells, 
■four  are  used  -for  each  input  circuit  cell  with  two  inputs. 
The  NOR  cells  named  N1  through  N12  are  shown  in  Figure  3.8. 
The  layout  o-f  the  circuit  be-fore  the  active  regions  are 
programmed  is  shown  in  Figure  3.9.  Figure  3.10  shows  the 
layout  with  input-partitioning,  and  Figure  3.11  without 
i n put -par t i t i on i ng. 

3.2.2  AND-QR 

The  heart  o-f  the  PLA  con-figuration  is  the  AND-OR. 
O-f  the  many  possible  combinations,  the  one  used  -for  this 
design  is  NOR-NOT-NAND .  It  should  be  noted  that  only 
parallel  transistor  con-figurations  are  used.  A  NOR  in  the 
first  phase  implies  that  it  should  be  driven  with  inverted 
inputs,  thus  providing  AND  logic  from  NOT -NOR.  The  NOR  is 
implemented  as  NRi ,  where  the  inputs  are  precharged  low  and 
the  outputs  are  precharged  high  during  the  precharge  phase. 
Logic  is  evaluated  in  r i pp 1 e-through  manner  during  the 
evaluation  phase,  conditionally  discharging  the  output 
node  . 


A  similar  combination  of  NOT-NAND  provides  the  OR 
logic.  A  P-type  gated  version  <PGa>  is  used  to  implement 
the  OR.  The  signal  used  for  the  precharge  and  evaluation 
devices  is  phaseld.  The  OR  plane  inputs  and  outputs  are 
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Two  input  signals  can  be  ANDed  before  entering  the  AND 
plane.  This  concept  termed  input-partitioning  is  used  as  a 
means  of  compacting  the  design.  The  input-partitioning  can 
be  done  with  NOR  gates  in  either  an  NGa  or  an  NGc  version; 
the  latter  is  used  for  this  design.  When 
input-partitioning  is  done  for  inputs  A  and  B,  then  AB, 
A'B',  A'B,  and  AB"  drive  the  AND  plane.  It  does  not  cost 
the  AND  plane  any  extra  inputs  because  every  input  also  has 
its  complement  driving  the  AND  plane;  for  two  inputs,  there 
are  four  input  lines  in  the  AND  plane.  The  programming 
capability  provides  the  option  of  having  either  the 
partitioned  inputs  or  the  original  inputs  at  the  output. 
The  PASCAL  program  of  Appendix  A  provides  the  option  of 
using  input-partitioning  for  the  design.  The  circuit  with 
input-partitioning  used  for  this  design  is  shown  in  Figure 
3.7.  This  Figure  shows  that  for  every  two  inputs  there  are 
four  outputs,  and  that  every  output  has  three  options. 
For  example,  at  the  partitioned  output  AB,  there  is  a 
possibility  of  obtaining  AB  or  A  or  B  by  programming. 
Theref ore ,  there  are  31  combinations  possible  altogether. 
For  simplicity  only  two  combinations  are  considered  for 
this  design. 


Ou  tou 1 1 


0utout2  0utDut3  0utout4 


delay  path  is  created.  It  should  be  noted  that,  of  all  the 
AND  plane  outputs  this  is  the  most  delayed  output.  This  is 
the  dummy  row  o-f  the  AND  plane;  it  is  placed  at  the  top  o-f 
the  plane. 

A-fter  passing  through  two  inverters,  phase2  is 
connected  to  the  only  input  o-f  NOR  which  controls  the  ouput 
node.  Uhen  phase2  is  low,  this  transistor  is  OFF  and  its 
output  is  precharged  high  with  the  PMOS  transistor.  When 
phase2  goes  high,  the  output  node  is  discharged  to  ground. 
This  node  a-fter  passing  through  two  inverting  buffers  is 
given  as  phaseld  signal  to  the  OR  plane  and  the  output 
latch.  Phaseld  should  go  high  when  phase  1  goes  high,  in 
order  to  -force  the  OR  plane  and  output  latches  to  precharge 
mode.  I-f  this  change  in  phaseld  is  expected  from  the  AND 
plane  output,  there  will  be  a  delay.  Therefore,  a  PMOS 
transistor  with  phase2  input  is  used  so  that  phaseld 
immediately  reacts  to  the  change  in  phasel .  The  layout  of 
this  cell  is  shown  in  Figure  3.17. 


3-2.4  Pmost  1  y  Latch 

The  dynamic  Pmostly  latch  is  used  to  latch  the  OR 


p  1  ane 

ou  tpi 

u  ts . 

Its  c  i  r>cu  i  t 

schematic  i 

n  Figure 

3.13 

shows 

that 

the 

i  npu 

t  must  be 

precharged 

I  ow . 

So 

during 

precharge , 

the 

logic  "0"  i 

npu  t 

turns 

the  hWOS 

tr  an 

s i stor 

OFF, 

bu  t 

the 

PMOS  trans 

i  stor 

wi  1  1 

be  ON. 

Th  i  s 
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transistor  is  in  series  with  another  PMOS  transistor  with 


phaseld  input.  During  precharge,  the  phaseld  signal  is 
high,  turning  off  the  PMQS  transistor.  Hence  during 
precharge,  there  is  no  path  to  the  output.  During 
evaluation,  if  the  input  is  conditionally  charged  high,  the 
NMOS  transistor  turns  on  and  a  valid  output  is  available. 
If  the  input  stays  low  during  evaluation,  the  PMOS 
transistor  is  ON  and  the  series  PMOS  transistor  with 
phaseld  input  is  also  ON  as  phaseld  is  low  during 
evaluation.  Thus  a  valid  output  is  available  in  either 


case  during  evaluation.  The  output  of  the  PLA  is  tapped  at 
the  inverting  buffer  output  because  the  previous  node  is  a 


-floating  node.  Since  the  additional  PMOS  transistor 
controls  the  latch,  this  latch  is  named  a  Pmostly  Latch. 
The  layout  o-f  the  latch  -for  two  inputs  and  two  outputs  is 
shown  in  Figure  3.19. 

3.2.5  SR-Latch 

The  schematic  o-f  the  Set-Reset  latch  is  shown  in 
Figure  3.20.  The  back  to  back  inverters  are  driven  by  NMOS 
transistors.  The  aspect  ratios  of  these  NMOS  transistors 
should  be  sufficiently  large  compared  to  the  internal  rMCS 
transistors  so  that  the  internal  latch  can  be  upset.  For 
the  present  design,  the  aspect  ratio  of  hWOS  transistors  is 
twice  as  large  as  that  of  PMOS  transistors.  The  set  and 
reset  transistor  inputs  come  from  the  OR  plane,  and  both  of 
them  are  turned  off  during  the  precharge  state.  For 
setting  or  resetting,  the  respective  input  should  go  high 
during  eval uat i on , ' enabl i ng  the  latch.  It  should  be  noted 
that  no  timing  signal  is  required  here,  but  two  OR  plane 
outputs  are  required  for  each  flip-flop.  Output  drive 
buffers  are  used  to  supply  adequate  source  current.  Figure 
3.21  illustrates  the  layout  for  the  set-reset  latch. 

All  the  cells  described  above  are  documented  in 
Appendix  C  with  their  circuit  schematics  and  layouts. 
These  cells  are  used  in  the  design  of  the  PLA  example  in 
the  following  section. 
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3.3.2  Seven  Segment  Display 

The  description  and  the  truth  table  -for  a 
seven-segment  display  is  shown  in  Table  3.2.  The 
seven-segment  display  needs  -four  inputs  and  seven  outputs 
•for  its  operation.  There-fore  inputs  12  through  15  and 
outputs  02  through  08  o-f  the  PLA  are  used. 


3.3.3  Test  Signals 

It  is  also  important  to  note  the  best  and  worst 
cases  o-f  the  PLA.  Test  signal  <a>  is  generated  by  applying 
a  logic  ■  1  ■  input  at  II  and  collecting  at  01  using  the 
bottom  implicant.  The  test  signal  (b)  is  generated  by 
feeding  logic  "1"  to  all  the  32  inputs  and  taking  the 
output  from  02S  by  using  the  top  implicant.  Test  signals 
(a)  and  <b)  correspond  to  the  best  and  worst  cases, 
respect i vel y .  With  these  test  signals,  the  propagation 
delays  in  the  AND  and  OR  planes  are  estimated  for  the  best 
and  worst  cases. 


3.3.4  Pr oor amm i no 

Input-partitioning  can  be  done  on  every  two  inputs, 
so  there  are  16  sets  which  can  be  partitioned.  For  the 
present  application,  input-partitioning  is  used  only  on  one 
set,  i.e.  on  inputs  13  and  14.  The  input  data  for  the 
PASCAL  program  to  code  this  information  is  given  below  in 


B 


Table  3.3. 
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Table  3.1  16  by  4  priority  encoder 
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Table  3.2  Seven  secment  display 
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o/pl 

o/p2 

o/p3 

o/p' 
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A 
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B' 

A 
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B 

B' 

A 

A 

B 

B' 

A 

Table  3.3  Data  for  input  partitioning 


The  input  and  output  truth  tables  for  the  PLA,  which 
include  the  examples  discussed  above,  are  shown  in  Table 
3.4  and  Table  3.5  respectively.  Tables  3.3,  3.4,  and  3.5 
are  given  as  input  data  to  the  PASCAL  program.  The  program 
gives  a  listing  of  all  cells  required  in  MSLL  format.  This 
is  then  appended  to  the  rest  of  the  MSLL  code  to  generate 
the  complete  layout.  Layouts  of  input  latches,  output 
latches,  AND  array,  and  OR  array  are  shown  in  Figures  3.22, 
3.23,  3.24,  3.25  respec t i ve 1 y . 


The  floor  plan  of  the  PLA,  including  the  I/O  buffers, 
is  shown  in  Figure  3.26.  The  pin  assignments  of  the  36-pad 
frame  are  shown  in  the  figure.  Nine  pads  are  multiplexed 
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-for  use  as  input  during  phasel  and  output  during  phase2. 
Eight  pads  are  left  unconnected.  Inputs  II  and  122  through 
132  are  connected  to  the  power  supply  and  outputs  013 
through  027  are  not  connected  since  they  are  not  used.  The 
interconnections  to  the  pads  are  illustrated  in  Figure 
3.26.  The  final  PLA  chip  layout  is  shown  in  Figure  3.27. 
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Figure  3.  26  Pin  assignments  and  floor  plan 

of  Large  PLA  chip 


CHAPTER  IV 


From  the  last  chapter,  it  is  obvious  that  it  is  not 
possible  to  have  ripple  AND-OR  circuitry  with  parallel 
devices.  The  alternative  is  to  use  ser i es-paral 1 e 1  or 
paral 1  el -ser i es  combinations.  Transistors  in  series  give 
rise  to  fan-in  problems.  A  way  of  improving  the  fan-in  and 
still  preserving  the  ripple  nature  will  now  be  considered. 
It  will  be  called  a  "moderate*  fan-in  PLA.  The  logic 
diagram  of  the  AND-OR  is  shown  in  Figure  4.1. 


Figure  4  .  1  AND-OR  of  Moderate  PLA 


The  logical  AND  is  implemented  in  two  stages,  NAND 
driving  NOR  to  give  AND. AND  logic.  Suppose  there  are  two 
inputs,  A  and  B,  driving  one  NAND  gate  and  two  inputs,  C 
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and  0,  driving  another  NAND  gate.  The  outputs  of  the  NAND 
gates  will  be  <AB>'  and  (CD)"'  respect  i  vel  y .  The  NOR  o-f 
these  two  is  <  <AB> '+<CD> ' ) '  -  <  <AB) '  >  '  .  <  <CD> -  AB.CD, 
thus  giving  the  AND  function.  The  circuit  schematic  for 
this  is  shown  in  Figure  4.2.  Series  NMOS  transistors  are 
used  to  generate  the  NAND,  and  series  PtiOS  transistors  are 
used  to  generate  the  first  NOR.  During  phasel,  all  the 
input  and  output  nodes  are  precharged  to  their  respective 
standby  states.  The  inputs  of  the  NAND  gates  should  be 
precharged  low  and  conditionally  charged  high.  If  four 
inputs  are  used  per  each  NAND  gate,  four  such  NAND  gates 
provide  the  flexibility  of  using  16  inputs,  whereas  having 
16  transistors  in  series  in  one  stage  is  not  possible.  The 
layout  of  the  AND  plane  with  16  inputs  is  shown  in  Figure 
4.3.  The  Figure  shows  the  technique  used  for  layout 
design.  The  OR  plane  is  implemented  with  NOR  having  NMOS 
transistors  in  parallel.  Its  inputs  are  precharged  low, 
and  its  outputs  are  precharged  high,  as  seen  in  Figure  4.2. 
The  layout  of  the  OR  plane  with  two  impli  cants  and  two 
outputs  is  shown  in  Figure  4.4. 

The  input  latch  shown  in  Figure  4.5  is  used  for  this 
PLA.  The  external  inputs  are  latched  during  phasel,  since 
the  C  switch  is  closed.  In  accordance  with  the  requirment 
of  the  AND  plane  inputs,  the  output  is  precharged  low.  The 
layout  of  input  latch  with  two  inputs  is  shown  in  Figure 


4.6. 


5  6 


Figure  4  .5  Gated  input  circuitry 

Th#  output  latch  shown  in  Figure  4.7  is  the  Nmostly 
latch.  The  dual  version  of  it  was  discussed  in  the 
previous  chapter.  During  phasel,  which  is  the  precharge 
phase  of  the  PLA,  its  input  is  precharged  high.  A  high 
input  turns  the  PMQS  transistor  OFF  and  turns  the  immediate 
NMQS  transistor  ON,  but  the  series  NMOS  transistor  with 
phase2  input  is  OFF.  Therefore  the  output  node  is  floating 
during  the  precharge  phase.  During  evaluation,  the  input 

is  conditionally  discharged  low  and  the  PMOS  transistor 

r 

turns  ON  conditionally.  If  the  input  remains  high,  there 
is  still  a  valid  output  because  phase2  is  high  during 
evaluation  and  thus  the  series  NMOS  transistor  turns  ON. 
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03  through  07  serve  as  the  controller  outputs.  The  cells 
of  this  PLA  are  included  in  the  cell  library  of  Appendix  C. 
The  small  PLA,  with  the  example  mentioned  above  programmed 
in,  is  shown  in  Figure  4.15. 


phase  1 


phase  2  j 

Precharge  PLA  Evaluate 

&  latch  inputs  ANO-OR 


Figure  4  .14  Timing  diogram  of  small  PLA 

4.3  The  chip  -frame 

A  36-pad  -frame  similar  to  that  o-f  the  previous  chapter 
is  used  here.  The  moderate  PLA  and  the  small  PLA  are  both 
•fitted  into  this  -frame.  The  small  PLA  uses  eight  pins  and 
the  moderate  PLA  uses  16  pins.  The  two  clocks  (phasel  and 
phase2>  provided  by  the  -frame  are  used  for  the  moderate 
PLA.  The  clocks  for  the  small  PLA  are  obtained  externally 
by  using  two  input  pads  for  phasel  and  phase2.  Six  pads  of 
the  chip  are  left  unconnected.  The  pin  assignments  are 
illustrated  in  Figure  4.16.  Figure  4.17  shows  the  layout 
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4.2.1  Circui t  Description 

The  AND-OR  for  this  PLA  is  generated  using 
N AND-NAND  logic,  where  parallel  PMOS  transistors  are  used 
for  th*  first  NAND  gats,  and  sari  as  NMOS  transistors  ara 
usad  for  tha  sacond  NANO  gata.  This  circuit  concapt  was 
salactad  bacausa  it  is  conuaniant  for  PPL,  tha  datails  of 
which  ara  axplainad  in  tha  naxt  chaptar.  Tha  same  circuit 
of  PPL  is  chosan  for  this  PLA  so  that  both  these  circuits 
can  je  compared  in  area  and  performance.  Chapter  <JI* 
discusses  such  comparisons. 

Tha  circuit  of  tha  small  PLA  is  illustrated  in  Figure 
4.13.  When  phasal  is  high,  inputs  are  latched  into  the 
input  latches,  and  tha  rest  of  tha  PLA  is  in  tha  standby 
state  or  precharga  state.  Uhen  phase2  (phasal ' )  goes  high, 
the  logic  is  evaluated  in  r i pp l a-through  manner  and  stored 
in  tha  output  latch.  A  detailed  analysis  of  this  circuit 
is  conducted  in  Chaptar  V.  Tha  timing  diagram  for  this  PLA 
is  shown  in  Figure  4.14. 

4.2.2  The  Desion  example 

The  traffic  light  controller  of  Mead  and  Conway 
(4),  tha  truth  table  of  which  is  shown  in  Table  4.3,  is 
programmed  into  this  PLA.  It  has  six  inputs  and  eight 


outputs.  One  of  tha  inputs  as  wall  as  one  of  the  output  is 
unused.  Inputs  13  through  15  art  tha  external  inputs,  and 
inputs  II, 12  ara  tha  feed-back  inputs  from  01,02.  Outputs 
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n#xt  section  ere  pieced  in  a  single  I/O  frame.  The 
description  of  this  frame  is  given  in  Section  4.3. 
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Table  4.2  Moderate  PLA  example 


4.2  Smal 1  PLA 

Close  observation  of  the  moderate  PLA  reveals  that 
logic  is  evaluated  in  three  stages,  NAND-NOR-NOR ,  though  in 
r i pp 1 e-through  fashion  to  give  AND-OR.  A  two-stage  AND-OR 
ripple  PLA  is  then  designed  which  can  be  used  for 
implementing  small  finite-state  machines. 
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Table  4.1  Controller  -for  subroutine  stack  and 
multiplexer  of  microseauencer 


The  AND  plane  is  programmed  in  poly  because  the  input 
lines  o-f  the  AND  plane  are  run  in  first  metal  and  the  OR 
plane  is  programmed  in  active.  The  layouts  of  the  AND 
plane  and  of  the  OR  plane,  with  the  example  of  Table  4.2 
programmed  in,  are  shown  in  Figure  4.11  and  Figure  4.12, 
respec t i ve 1 y .  This  PLA  along  with  the  small  PLA  of  the 


The  timing  associated  with  the  moderate  PLA  is  shown  in 
Figure  4.10.  The  cells  of  the  moderate  PLA  are  documented 
in  Appendix  C  with  their  circuit  schematics  and  layouts. 
The  design  example  o-f  the  next  section  uses  this  library. 
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phase  2  I 


Latch  inputs 
&  precharge 
the  PLA  to 
standby  state 


Evaluate  PLA 
logic  C  AND-OR  ) 


Figure  4  .10  Timing  diagram  of  Moderate  PLA 


4,1.1  The  Moderate  PLA  Desion  example 

The  moderate  PLA  design  has  eight  inputs  and  ten 
outputs.  The  example  loaded  is  a  controller  for  the 
multiplexer  and  the  subroutine  stack  of  a  microsequencer 
<9).  Table  4.1  shows  the  truth  table  of  this  controller. 
Table  4.2  is  derived  by  two  functions:  the  controller 
example  described  above  and  a  worst-case  delay  test.  A 
worst-case  delay  path  is  created  by  giving  logic  "l"  to  all 
the  inputs.  Inputs  II  through  14  and  outputs  01  through  0? 


In  either  case,  a  valid  output  is  obtained  dur i ng 
evaluation.  The  layout  o-f  the  output  latch  with  two  inputs 
is  shown  in  Figure  4.8. 


Figura  4  .7  Dynamic  NMOSTLY  latch 

Figure  4.9  illustrates  the  overall  circuit  o-f  the 
moderate  PLA.  A  two-phase  clock  scheme  is  used.  During 
phasel ,  the  external  inputs  are  latched  and  stored  on  the 
gate  capacitance  o-f  the  input  latch,  and  the  rest  o-f  the 
PLA  is  in  the  standby  state  with  inputs  and  outputs 
precharged  to  their  respective  states.  For  evaluation, 
phase2  goes  high  and  the  logic  is  evaluated  in  a 


r  i  pp 1 e-through  manner  and  stored  in  the  output  latches. 
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PATH  PROGRAMMABLE  LOGIC 

5. 1  Introduct i on 

Path  Programmable  Logic  (PPL)  is  a  variation  of  PLA  in 
which  the  AND  and  the  OR  planes  are  folded  on  top  of  each 
other.  The  logical  OR  is  achieved  by  combining  data  along 
a  single  column,  and  the  logical  AND  is  achieved  by 
combining  data  along  a  single  row.  All  columns  and  rows 
can  be  broken  at  arbitrary  locations  and  with  memory 
elements  distributed  in  the  array,  the  PPL  has  the 
potential  to  have  multiple  independent,  finite-state 
machines  and  data  path  modules  on  a  single  IC  chip.  This 
project  aims  at  designing  PPL  in  CMOS  technology.  The 
circuit  is  represented  symbol i cal  1  y ,  and  a  cell  library  is 
developed  to  help  in  designing  the  layouts. 

3 . 2  PPL  Ci rcu i t 

Ripple  circuitry  is  desirable  for  PPL  to  allow 
multiple  levels  of  logic,  for  example,  the  AND-OR  output 
can  serve  as  another  input  into  the  AND  plane. 

The  AND-OR  for  the  PPL  can  be  implemented  in 
ripple-through  fashion  using  both  NOR-NOR  and  NAND-NAND 
logic.  Uhen  using  NAND  and  NOR,  an  inverter  is  needed 


between  them  to  generate  AND-OR;  this  involves  additional 


circuitry.  The  NOR-NOR  can  be  implemented  either  with 
parallel  N-gates  driving  series  P-gates  (.Figure  5.1a),  or 
with  series  P-gates  driving  parallel  N-gates  (Figure  5.1b). 
AND-QR  with  NOR-NOR  requires  series  P-gates  which  are 
slower;  therefore,  it  will  not  be  discussed  further.  The 
NAND-NAND  can  also  be  implemented  either  with  parallel 
P-gates  driving  series  N-gates  (Figure  5.2a)  or  with  series 
N-gates  driving  parallel  P-gates  (Figure  5.2b).  In  both 
ways  of  implementing  NAND-NAND,  we  have  series  N-gates 
either  in  the  AND  plane  or  in  the  OR  plane.  Minimizing  the 
fan-in  of  the  N-gate  is  very  desirable.  However,  it  is 
difficult  to  judge  whether  there  will  be  a  greater  number 
of  inputs  to  the  AND  plane  or  a  greater  number  of 
implicants  affecting  one  output  in  the  OR  plane,  since  this 
depends  on  the  particular  application.  A  detailed 
discussion  of  this  concept  is  given  in  Chapter  VI  .  From 
the  layout  point  of  view,  the  circuit  with  parallel  P-gates 
driving  series  N-gates  is  used  for  this  project.  It  should 
be  noted  that  PPL  can  also  be  implemented  with  the 
remaining  combinations. 

In  Figure  5.2a,  which  shows  the  circuit  used  for 
implementing  the  AND-OR,  the  AND  plane  outputs  are 
precharged  low  and  conditionally  charged  high.  Its  inputs 
are  precharged  high,  and  any  one  of  the  inputs  going  low 
will  pull  the  AND  output  high.  Similarly,  the  OR  plane 
outputs  are  precharged  high  and  conditionally  discharged 


low.  For  the  OR  output  to  discharge  low,  all  o-f  its  inputs 
should  be  high;  otherwise  the  output  remains  high.  The 
point  o-f  concern  here  is  charge  splitting  (discussed  in 
Section  2.2  o-f  Chapter  ID,  which  might  result  in  the 
output  -falling  to  unacceptable  levels. 


An^l— 1 


phase  1  — I 


.  phase  1 


Figure  5  .3  Improvised  circuit  for  AND-OR 
to  avoid  charge  splitting 


To  overcame  this,  the  improvised  circuit  shown  in 
Figure  5.3  is  used.  Here  the  drain  nodes  o-f  all  the  NMOS 
transistors,  along  with  the  output  node,  are  precharged 


here  as  wall.  For  simplicity,  however,  it  is  not 
considered  in  this  design. 

The  dynamic  Nmostly  latch  of  Figure  5.3  is  used  as  the 
output  latch.  The  operation  of  this  latch  has  already  been 
discussed  in  Chapter  IV.  Many  designers  o-f  state  machines 
•find  set-reset  latches  use-ful  .  There-fore,  a  set-reset 
latch  has  been  generated  using  classic  NANO  gates,  as  is 
illustrated  in  Figure  5.6.  The  NAND  gates  are  used  because 
the  OR  plane  is  precharged  high.  A  high-going-low  signal 
from  the  OR  plane  will  set  or  reset  the  output.  The  timing 
diagram  for  the  PPL  is  illustrated  in  Figure  5.7. 


Figure  5  .5  N-mostly  latch 
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3.3  PPL  Cell  Set 

A  precise  PPL  cell  set  is  developed  with  the  above 
concepts  in  view.  For  the  present,  this  cell  set  can  be 
used  to  implement  finite-state  machines.  As  mentioned 
previously,  lambda-based  scalable  design  rules  documented 
in  Appendix  B  are  used  for  the  design  of  these  cells.  The 
size  of  PPL  cells  is  estimated  in  units;  one  unit  is  equal 
to  eight  by  eight  lambda.  The  reference  point  of  all  the 
cells  is  at  the  bottom  left  corner.  Provision  has  been 
made  to  break  the  rows  when  designing. 

The  PPL  cells  are  mainly  divided  into  six  catagories: 
input  cells,  output  cells,  AND  cells,  OR  cells,  precharge 
cells,  and  interconnect  cells.  The  first  five  of  these  are 
used  for  the  PPL  design  in  symbolic  form,  and  the  remaining 
type  is  used  by  the  designer  for  interconnects.  The  AND 
and  OR  cells  are  considered  to  be  the  heart  of  the  symbolic 
program.  Figure  3.8  illustrates  the  floor  plans  of  AND 
and  OR  cells  in  general.  It  shows  that  the  inputs  and 
outputs  run  vertically  and  that  the  implicants  run 
hor i zontal 1 y .  Therefore,  a  row  line  can  be  broken  to  have 
multiple  implicants  on  a  single  row,  and  similarly  a  column 
line  can  be  broken  to  have  multiple  inputs  and  outputs  on  a 
single  column.  This  feature  will  be  useful  in  minimizing 
the  circuit  area  and  in  undertaking  more  levels  of  AND-OR. 

The  cells  in  the  PPL  cell  set  are: 


Impl icant 


Output  +V 


AND  call  OR  cell 

Figure  5  .8  Floor  plan  of  AND  &  OR  calls 


A)  0 

(zero)  : 
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element  of  the  AND  plane 
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detects 

the  false  state  of 
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input.  Choosing 

the 

•0" 

cell  in 

a  col umn  impl i es 

the 

selection  of 

the 

complement  signal  in  forming  the  impl icant  term  associated 
with  that  respective  row.  The  size  of  this  cell  i s  2  by  2 
uni ts. 


8)  \  (gny)  :  The  combinational  element  of  the  plane 
whichs  detects  the  true  state  of  the  input.  This  cell  is 
used  if  the  true  signal  of  the  input  is  used  in  forming  the 
impl icant.  The  size  of  this  cell  is  2  by  2  units. 


C)  +  <or) 


The  cell  transmitts  the  row  element  on  to  the 


Of?  column.  This  cell  is  used  to  perform  the  OR  function  of 
the  implicants.  The  size  of  this  cell  i s  3  by  2  units. 

D>  S"  -<set/,>  :  The  combinational  element  detects  the 
condition  for  not  setting  the  set-reset  flip-flop.  This 
cell  is  used  to  place  the  implicant,  which  corresponds  to 
the  above  condition,  on  the  OR  column.  The  size  of  this 
cell  i s  4  by  2  un i ts. 

E)  R'  (reset')  :  The  combinational  element  detects  the 
condition  for  not  resetting  the  set-reset  flip-flop.  This 
cell  is  used  to  place  the  implicant,  which  corresponds  to 
the  above  condition,  on  the  OR  column.  The  size  of  this 
cell  i s  4  by  2  un i ts. 

F)  S/R'r  (set '-reset" )  :  The  combinational  element  detects 
the  condition  for  not  setting  and  resetting  the  set-reset 
flip-flop.  This  cell  places  the  implicant  on  the 
respective  OR  columns.  The  size  of  this  cell  i  s  4  by  2 
uni ts. 

G)  IL  _<  i  nout  -1  atch)  s  The  gated  input  latch  (discussed 
in  Section  5.2)  is  used  to  latch  the  input.  The  size  of 


this  cell  is  2.5  by  13  units 
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H)  QL  (output  latch)  :  The  Nmostly  latch  or  gated  D-l atch 
which  can  be  used  to  drive  the  external  logic  and  to  store 
the  "state"  code.  The  size  of  this  cell  is  2.5  by  12 
units. 

I)  FF  (set-reset  latch)  s  A  set-reset  -flip-flop  which  can 
be  used  as  a  memory  element  or  to  generate  the  state 
outputs.  The  size  o-f  this  cell  is  4  by  12. 

J)  PR  _( orecharoe-row)  :  An  NMOS  transistor  used  to 

precharge  a  row.  The  size  o-f  this  cell  i  s  1  by  1  units. 

K>  PC  A  orecharoe-col  umn )  s  A  PMOS  transistor  used  to 
precharge  a  column.  The  size  o-f  this  cell  i  s  1  by  1  units. 

L)  RI  (row-i nterconnect)  :  The  cell  used  to  interconnect 
a  row.  The  size  o-f  this  cell  i  s  2  by  2  units. 

M>  Cl  _< col umn- i n terconnec t )  :  The  cell  used  to 

interconnect  a  column.  The  size  o-f  this  cell  i  s  3  by  2 
uni ts . 

N)  SRI  <set-reset  col umn- i n terconnec t )  s  The  cell  is  used 
to  interconnect  set-reset  column.  The  size  of  this  cell  is 


4  by  2  units 
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Cells  A  through  K  -facilitate  the  design  o-f  a  PPL  in 
symbolic  -form;  however,  the  layout  designer  needs 
additional  cells  L  through  N  -for  interconnects.  Row  breaks 
are  provided  -for  3  cells:  zero,  one,  and  RI  .  The  rotated 
and  mirrored  versions  o-f  the  PPL  cell  set  are  also 
available.  Section  5.6  demonstrates  the  use  o-f  this  PPL 
library  with  an  example. 

5 • 4  Symbolic  Representation 

This  section  illustrates  the  use  o-f  symbols  to 
represent  the  PPL  designs.  An  Exclusive-OR  example  is  used 
in  this  illustration.  The  logic  diagram  and  its  PPL 
symbolic  representation  are  shown  in  Figure  5.9. 

There  are  3  rows  and  3  columns  in  the  program.  The 
input  latches  occupy  columns  1,2  o-f  row  1  -for  latching 
inputs  A  and  B.  The  output  latch  -for  F  is  placed  at  row  1, 
column  3.  A  "0"  -from  input  A  is  detected  by  the  *0"  placed 
in  row  2,  column  1,  and  a  "1*  from  input  B  is  detected  by 
the  "l*  placed  in  row  2,  column  2.  The  "0*  and  *1*  in  row 
2  specify  the  AND  condition,  which  is  transmitted  to  the  OR 
column  by  placing  a  ■+■  in  row  2,  column  3.  The  structure 
of  row  3  can  be  analyzed  using  a  similar  argument.  The  OR 
condition  is  specified  by  the  in  column  3,  rows  2  and 
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CHAPTER  VI 


4CLUSIQN* 


In  this  concluding  chapter,  the  different  PLA  and  PPL 
design  methodologies  described  in  the  preceeding  chapters 
are  compared.  A  discussion  of  future  developments  is  also 
made  . 


In  Chapter  III,  a  "large*  PLA  design  was  described. 
The  parallel  gates  structure  of  the  AND  and  OR  planes  used 
for  this  PLA  is  more  suitable  when  designing  a  controller 
for  large  systems  where  fan-in  is  high,  as,  for  example,  in 
the  controller  of  a  data  path.  The  core  cells  of  the  large 
PLA  have  an  area  of  19  by  19  lambda  in  the  AND  plane,  and 
22  by  19  lambda  in  the  OR  plane.  The  PASCAL  program  of 
Appendix  A  facilitates  programming  the  AND  and  OR  planes  of 
the  PLA.  This  program  is  intended  as  only  a  temporary 
means  of  programming,  however.  In  the  long  run,  a  more 
general  approach  for  PLA  generation,  e.g.,  a  Silicon 
Compiler,  is  needed. 


The  ripple  PLA's  described  in  Chapter  IV  are  not 
useful  for  systems  with  large  numbers  of  i nputs/impl i cants; 
they  can  be  used,  however,  for  systems  with  few  number  of 
i  nputs/impl i cants.  With  a  restriction  in  fan-in,  the 
ripple  PLA's  are  simple  and  faster  and  can  drive  successive 
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The  layout  o-f  the  TLC  has  been  designed  using  the 
symbolic  program  generated.  Since  suitable  software  -for 
automatic  generation  o-f  the  layout  -from  the  symbolic 
program  has  not  been  developed  yet,  the  layout  in  this 
example  has  been  done  manually  on  a  Chromatics  graphic 
computer.  The  Chromatics  user  creates  a  work-sheet  from 
the  symbolic  program  as  shown  in  Figure  5.13.  An  *ma 
following  the  PPL  cell  denotes  that  the  cell  is  mirrored 
and  a  "b"  denotes  a  row  break.  The  placing  of  the  cells 
can  be  better  understood  if  one  goes  through  the  PPL  cell 
set  thoroughly  and  inspects  the  work-sheet  in  Figure  5.13. 
The  layout  of  the  TLC  generated  with  the  aid  of  the 
work-sheet  is  shown  in  Figure  5.14. 

A  20-pad  frame  has  been  used  for  this  PPL  example. 
The  pin  configurations  are  shown  in  Figure  5.15.  Eight 
pins  are  not  used.  The  picture  of  the  final  chip  is  shown 
in  Figure  5.16. 
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terms,  possibly  with  an  increase  in  implicant  terms. 
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in  "0*  state,  and  that  this  implicant  driving  one  of  the 
series  aNa  switches  -forces  the  overall  gate  to  be  OFF,  thus 
■forcing  its  output  to  remain  in  its  precharged  ala  state. 
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for 
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This  problem  can  be  overcome  in  two  ways: 

1>  By  having  series  NMOS  transistors  driving 
parallel  PMOS  transistors.  For  the  selected  input 
combination,  all  the  NMOS  devices  are  ON,  pulling  the 
output  node  low,  which  in  turn  activates  the  parallel  PMOS 
device.  Thus  a  transition  from  the  precharged  *0*  state  to 
ala  is  achieved.  The  core  cell  o-f  this  arrangement  has 
dimensions  o-f  26  by  16  lambda  as  opposed  to  16  by  16  lambda 
as  in  the  previous  arrangement.  There-fore  it  requires 
almost  double  the  area. 

2)  By  generating  set'  and  reset'  at  the  AND-OR 

output. 

Since  the  -former  method  o-f  getting  around  the  problem 
o-f  set  and  reset  is  not  economical,  the  latter  method  has 
been  used.  The  design  can  also  be  simplified  by  using 
Karnaugh  maps.  Wi  th  this  technique  \  more  simplified 
version  is  generated,  often  leading  to  a  decrease  in  ORed 


NAND;  NMOS  transistors  in  ser i as  generate  the  second  NAND . 
This  is  illustrated  with  switches  in  Figure  5.10. 


Figure  5  . 10  AND-OR  functioning 


For  example,  in  the  first  stage,  a  combination  of  101 
opens  all  the  "P"  switches;  then  the  output  of  this  gate 
remains  in  the  precharge  state  of  "0"  .  For  all  remaining 
combinations,  at  least  one  of  the  switches  will  be  closed, 
pulling  the  output  node  to  Bla.  In  the  second  stage,  the 
output  goes  to  ground  only  if  all  the  switches  are  closed 
by  hawing  their  inputs  high;  otherwise  the  output  remains 
in  the  precharged  high  state.  It  is  of  interest  to  note 
that  the  selected  input  combination  will  hawe  its  implicant 
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Figure  5  .9  Symbolic  representation  of  Exclusive-OR 


For  the  conditions  shown  in  the  symbolic  program,  the 
output  is  'i*  i  otherwise  it  is  "0",  which  implies  an 
Excl  usi  ve-OR  -function.  With  this  kind  of  symbolic  program, 
the  user  need  not  worry  about  the  internal  circuitry.  It 
is  also  easy  to  walk  through  the  program  and  make  necessary 
changes.  A  cell  set  which  can  be  used  for  writing  symbolic 
programs  is  developed  in  the  next  section. 


5.5  Design  Methodology 

The  AND-OR  for  the  PPL  is  generated  using  NAND-NAND 
logic.  PMOS  transistors  in  parallel  generate  the  first 


stages  during  the  same  clock  period.  The  "moderate"  PLA  is 
used  as  a  compromise  between  the  "large"  and  the  "small" 
PLA's.  For  this  PLA,  the  -fan-in  can  be  more  than  that  o-f  a 
small  PLA,  but  with  a  sacrifice  in  circuit  area  and  speed. 
It  is  advisable  to  use  such  a  system  if  the  fan-in  needs  to 
be  increased,  thus  preserving  the  r i pp 1 e-through  nature. 
The  core  cells  have  an  area  of  36  by  28  lambda  in  the  AND 
plane,  and  23  by  28  lambda  in  the  OR  plane.  The  "small" 
PLA  is  suitable  for  implementing  small  finite-state 
machines.  The  core  cells  of  this  PLA  have  an  area  of  32  by 
32  lambda  in  the  AND  plane,  and  26  by  32  lambda  in  the  OR 
plane.  Table  6.1  lists  the  dimensions  of  the  AND  and  OR 
planes  of  these  PLA's. 

Table  6.1  is  not  sufficient  for  the  comparison  of  the 
different  PLA's  studied.  In  a  PLA  design,  there  are  input 
latches,  output  latches,  and  precharge  devices  apart  from 
the  AND  and  OR  plane  circuits.  This  periphery,  along  with 
its  interconnects,  plays  an  equally  important  role  in  the 
layout  design.  For  example,  the  OR  plane  dimensions  of  the 
large  PLA  are  limited  by  the  output  latches,  and  similarly, 
the  AND  plane  dimensions  of  the  small  PLA  are  limited  by 
the  OR  plane.  Therefore,  a  more  reasonable  comparison  of 
the  PLA  discussed  is  to  estimate  the  total  area  for 
generating  a  PLA.  The  total  area  for  four  inputs,  eight 
implicants  and  eight  outputs  using  the  different  PLA 
schemes  is  listed  in  Table  6.2.  This  table  indicates  that 
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the  size  of  the  large  PLA  is  smaller  than  that  of  the 
ripple  PLA's.  Every  PLA  structure  discussed  has  its  own 
advantages.  This  project  introduces  a  finite-state  machine 
designer  to  the  different  PLA  structures,  so  that  the  best 
PLA  suitable  for  the  application  can  be  chosen. 

The  PPL  design  described  in  Chapter  Visa  folded  form 
of  the  PLA.  The  circuit  used  for  the  PPL  is  similar  to 
that  of  the  small  PLA  circuit.  Tables  6.1  and  6.2 
illustrate  the  core  cells  area  and  the  total  area  for 
implementing  a  PPL  for  four  inputs,  eight  implicants  and 
eight  outputs,  respect i vel y .  A  symbolic  form  of 
representing  the  design  is  described,  and  a  suitable  cell 
set  is  developed  for  designing  the  PPL.  The  design  example 
in  Chapter  V  also  gives  an  insight  into  the  procedure  of 
designing  the  PPL.  In  order  to  activate  the  set-reset 
latches,  set'  and  reset'  are  required  instead  of  set  and 
reset  outputs  from  the  OR  plane.  This  increases  the  number 
of  series  NMGS  transistors.  To  overcome  this  feature, 
minimization  techniques  are  applied  to  decrease  the  series 
transistor  count.  By  this  procedure,  a  penalty  is  paid  in 
the  number  of  rows  < impl i cants) .  In  the  example  of  Section 
5.6,  the  number  of  rows  increased  from  10  to  13.  This 
concept  of  the  PPL  needs  further  development.  A  possible 


solution  is  to  use  0-latches  instead  of  SR-latches 


The  direct  comparison  of  the  PPL  with  the  PLA  yields 
-four  immediate  advantages: 

1)  The  area  o-f  the  PPL  is  56,472  square  lambda 
against  the  60,300  square  lambda  o-f  the  small  PLA. 

2)  The  row  breaks  and  column  breaks  allow  more 
implicants  on  a  single  row  and  more  outputs  on  a  single 
column.  In  the  PLA,  rows  and  columns  are  dedicated  for 
single  implicants  and  outputs. 

3)  In  the  PPL,  the  inputs  and  outputs  are  alternated 
where  suitable.  This  is  possible  because  of  the  folded 
nature  of  the  AND-OR  planes.  This  capability  of  the  PPL  is 
helpful  for  a  compact  external  routing,  whereas  the  PLA 
lacks  the  capability. 

4)  The  PPL  can  be  expanded  to  handle  functions  like 

<AB+CD)*<A'C+AC'D' )+<EF+A'D>*<AEF'+CDE' >  or 

< ABC+A' EF) *B' CD ,  which  are  difficult  to  handle  using  the 
PLA.  With  this  capability,  the  PPL  can  be  used  for 
realizing  more  generalized  functions  than  the  conventional 
2-level-  AND-OR  functions  handled  by  the  PLA. 

Therefore,  the  PPL  is  definitely  a  more  powerful  tool 
than  the  PLA.  The  PPL  can  also  be  made  useful  for 
implementing  functions  other  than  finite-state  machines. 
Addition  of  storage  cells  at  regular  intervals  in  the  array 
makes  the  PPL  convenient  for  data  path  modules  as  well. 
For  implementing  functions  using  the  PPL,  the  symbolic 
program  will  be  more  convenient  for  the  designer.  This, 


with  the  aid  of  suitable  so-ftware,  can  greatly  reduce  the 
design  effort.  The  PPL  design  methodology  with  so-ftware 
support  has  the  potential  to  rival  semi -custom  designs  in 
the  design  e-f-fort  and  to  provide  densities  more  nearly 
approaching  those  o-f  full-custom  designs.  These  points 
should  be  considered  in  the  future  development  of  the  PPL. 


Large 

Moderate 

Smal  1 

PPL 

PLA 

PLA 

PLA 

Dimensions  of 
(in  1 ambda) 

AND  cell  : 

19  x  19 

36  x  28 

32  x  32 

32  x  32 

OR  cel  1  : 

22  x  19 

23  x  28 

26  x  32 

48  x  32 

Table  6.1  Comparison  of  AND-OR  planes  of  different  PLA' 


Large  PLA 
Moderate  PLA 
Small  PLa 
PPL 


Area  in 
lambda  square 

57,928 
61 ,780 
60,300 
56 , 472 


Table  6.2  Circuit  area  ■for  4-inputs  6c  8-outputs  using 


differnt  PLA's 


J  ■  'i  ■  L'  *L 


.•/j 
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c 

PLA  PROGRAM 
By 

NAINI 


Input:  The  input  to  the  program  is  the  truth  table  of  the 
function  to  be  programmed  into  the  PLA.  The  user  also  need 
to  select  the  input  partitioning  option. 

Output:  The  program  generates  a  MSLL  (Mississippi  State 
Layout  Language  )  file  which  places  transistors  in  the  AND 
plane, OR  plane  and  the  input  circuit  of  the  PLA. 

> 

PROGRAM  Main  (Input,  Output,  Andarray,  Orarray)  ; 


CONST 


A  Cl 

*  '  acl 

AC2 

»  '  ac2 

AC3 

*  '  ac3 

AC4 

=  *  ac4 

0C1 

a  /  OC  1 

0C2 

a  '  OC2 

0C3 

»  'OC3 

0C4 

a  7  OC4 

OR  ROW  »  28 

OR  COL  »  28 

AND  ROU  a  28 

AND 

COL  a  64 

TYPE 

Str5  *  Packed  Array  C 

Str4  =  Packed  Array  E 

Str3  »  Packed  Array  E 

N_Type  »  Array  El.. 

PLA_r ec  *  Record 

Row, 
Col umn 
Cel  1 
End  ; 

And_Type  =  Array  E  1 

1 

Or_Type  *  Array  E  1 

1 


1  . .  5  ]  of  Char  ; 

1  . .  4  3  of  Char  ; 

1  . .  3  ]  of  Char  ; 

64  3  of  Str4  ; 


:  Real  ; 

:  Str3  ; 

. .  AND_R0U, 

.  .  AND_C0L  3  of  PLA_r  e  c 
. .  0R_R0W , 

. .  OR  COL  3  of  PLA  rec 


VAR 


x  * 
y 

And_Pl ane 
0r_Pl ane 
N  cel  1 


Ch 


i  » 

j  i 

AA_0utput 


:  Integer 
:  And_Type 
:  0r_Type 
:  N_Type 
:  Char 


:  Integer 


*‘.1 

wV< 
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A_Input , 

B_Input 
An  dam  ay, 

Or  am  ay 

PROCEDURE  Val idationcell  <VAR  N_ce11  :  N_Type 

VAR  Nor_in, 

a » 
b, 

c  :  Str5  ; 
one , 

/  two, 

three  :  Str4  ; 
x  :  Integer  >  ; 

BEGIN 

If  Nor_in  *  a  Then 
N_c e 1 1 C x ]  : «  one 

El  se 

If  Nor_i  n  =*  b  Then 
N_ce1 1  Cx]  :=*  two 

Else 

If  Nor_ i  n  *  c  Then 

N_ce11Cx3  s=  three  ; 

Write  In  <'c  N__ce11£x3  s  4,  ",  ', 

<-613  +  <<x-l>  Div  4>  *  38 >  s  0, 

-112,0,1,0,1,0  ')  ; 

END  C  val i dat i once  11  >  ; 

PROCEDURE  Read_data  <VAR  N_cell  :  N_Type>  ; 
VAR 

I  > 
j  » 

x  :  Integer  ; 

Ch  :  Char  ; 

Nor__  i  n , 

*» 

b, 

c  :  Str5  ; 

one , 

two, 

three  :  Str4  ; 

BEGIN 

For  i  :■  1  to  16  do 
Begi  n 

For  j  :*  1  to  4  do 
Begi  n 

Read  <Nor_in,  Ch)  ; 
x  :*  <  i  -1  >  *  4  ♦  j  ; 

Case  j  of 
4  : 

Begin 

a  'NA.B  '  ; 


:  Char 
s  Text 
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b  :=*  'NA  '  i 
c  '  B  '  j 

one  :=  'nlO  ' 
two  s*  'nil  ' 
three  s*  'nl2 


Begin 

a  :«  'NA.NB' 
b  :»  'NA 
c  :*  'NB 
one  :=  'n7  ' 

two  :»  'n8  ' 

three  :»  'n9 
End  ; 


Begin 
a  : 
b  : 


a  :»  ' A.B  ' 
b  :=  'A 
c  :*  'B 
one  s*  'n4  ' 

two  :=  'n5  ' 

three  s*  'n6 


Begin 

a  s-  'A.NB  ' 
b  'A 

c  'NB 

one  :»  'nl 
two  :»  'n2 
three  !=»  'r>3 
End  : 


Validationcell  <N_cel  1  ,  Nor_in,  a,  b,  c, 

one,  two,  three,  x>  ; 

End  ;  <  -for  j  > 

Readln  ; 

End  ;  C  for  i  > 

End  ;  <  Read_data  > 

PROCEDURE  Wr i te_l i ne  ; 

BEGIN 

Ur i te 1 n  <  ' - '  > 

END  ; 

PROCEDURE  In i t i al i ze_and  ; 


BEGIN  <  ini 
For  i 


:  Integer  f 
l  To  AND  ROW  DO 


•*■**  •  •. 


a  .*•  ** *»  ^  . 
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For  j  :»  1  To  AND_C0L  DO 

with  and_plane  C  i , j  1  DO 
cell  :*  'xxx' ; 

END  ;  C  ini  } 

PROCEDURE  Initialize  or  ; 

i 

In  this  procedure  the  0R_plane  is  initialized  to  xxx 

> 

VAR 

1  t 

j  :  Integer  ; 

BEGIN  <  ini  > 

For  i  i»  1  To  0R_R0U  DO 

For  j  :»  1  To  0R_C0L  DO 

with  or_plane  t  i , j  3  DO 
cell  •**  *  xxx  '  ; 

END  ;  C  ini  > 

PROCEDURE  Print  and  ; 

£ 

This  procedure  print 

} 

VAR 

■  » 

j  :  Integer  ; 

BEGIN  £  print  > 

write  < '  '  s  6 ) 

For  j  :»  AND_COL  Down to  1  DO 
write  <'  '  :  4,  j  !  2> ; 
wr  i  tel n  ; 

For  i  :■  AND_R0tJ  Down  to  1  DO 
Begi  n 

write  < '  '  s  4,  i  :  3,  '  '  :  2); 

For  j  s«  AND_C0L  Down  to  1  DO 

write  <and_pl aneC i , j 3 .cel  1  :  4, 

wr i tel n  ; 

End  ; 

END  ;<  print  ) 

PROCEDURE  Pr i n  t_or  ; 

VAR 

•  » 

j  :  Integer  ; 

BEGIN  <  print  ) 

write  < '  '  :  6) ; 

For  j  ■»  1  To  0R_C0L  DO 

wr  •e<‘''  :4,j  :2)  ; 
wr i tel n  ; 

For  i  :*  0R__R0W  Down  to  1  DO 
Begi  n 

write  ('  '  j  4,  i  :  3,  '  '  :  2)  ; 

For  j  i»  1  To  0R_C0L  DO 

write  <or_p l aneC i , j 3 . ce 11  s  4, 

'  '  s  2  )  ; 

wr i t  e 1 n  ; 

End  ; 


no 


END  5  {  print  > 

PROCEDURE  pr i nt_and_coordi nates; 

VAR 

•  » 

j  :  Integer  ; 

row, 

column  :  Real  ; 

BEGIN 

For  i  :■  1  To  AND_ROUJ  DO 
Begi  n 

For  j  :»  1  To  ^D_COL  DO 
Begi  n 

with  and __p1aneC  i ,j  ]  DO 
Begi  n 

If  cell  <>  'xxx'  Then 
Beg  i  n 

wr  i tel n  ('c  cell  :  2, 

' ,  ' ,  C  “623  +  <<  j-l> 

Diw  2)  *  19)  :  0,  ( 

<230  -  <  < i-l>  Di  v  2) 

*  1 9>  >  :0,' ,0,1  ,0,1  ,0')  ; 
End  ; 

End  ; 

End  ; 

End  ; 

END  ;(  pr i n  t_and_coordi nates  > 

PROCEDURE  print  or  coordinates  ; 

VAR 


j  :  Integer  ; 

rowl  , 

columnl  :  Real  ; 

BEGIN 

For  i  :■  l  To  OR_ROW  DO 
Begi  n 

For  j  :■  1  To  0R_C0L  DO 

Begi  n 

with  or__planeC  i  ,j  ]  DO 

Begin 

If  cell  <>  'xxx'  Then 

writeln  ( '  c  cell  :  2, 
'  '  ,  <  35  +  <<  j  - 
1)  Di v  2)  *  22)  :  0,  '  , 

<  -3  +  <  <  i  -  1 )  Di v  2) 

*  19)  :0 , '  ,  0 , 1  ,0 , 1 ,0' >  ; 

End  ; 

End  ; 

End  ; 

END  ; Cpr i n t_or_coordi nates) 


PROCEDURE  And_Logi c  <VAR 


j  :  Integer  ; 
a_ i n  p  u  t , 


V  V 


Ill 


BEGIN 


b_input  :  Char  )  ; 


Case  <j  Mod  4)  of 
1  : 

If  <<a_in pot  »  '0'> 

And 

<b_input  =*  "l') 

And 

<n_ce11C  x  3  *  'nl  ')> 

Or 

<<a_input  =  '0') 

And 

<  n_ce 1 1 [  x  3  »  ' n2  ' >  > 

Or 

<<b__input  =  '  1 '  ) 

And 

<n_cell[  x  ]  =  '  n3  '))  Then 
Begin  C  ab'  > 

With  and_planeC  i,j  ]  Do 
Begi  n 

If  ( i  Mod  2)  *  1  Then 
cell  ' ac4  ' 


El  se 


cell  :=*  'ac2  '  ; 
row  :=*  i  ; 
col umn  :**  j ; 

End  ; 

End  ;(  ab'  > 

2  : 

If  <<a_input  *  '0') 

And 

<b_input  *  '0') 

And 

<n_cel 1 C  x  ]  ■  'n4  ')> 

Or 

<<a_input  *  '0") 

And 

<n_ce11C  x  1  a  'n5  ')) 

Or 

<<b_input  *  '0") 

And 

<n_cell[  x  ]  -  'n6  '))  Then 

Begin  Ca'b'} 

With  and_planeC  i ,j  3  Do 
Begi  n 

If  (i  Mod  2)  *  1  Then 
cel l  :=  ' ac3  ' 


Else 


cel  1  :*  'acl  ' ; 
row  :  =  i  ; 
col  umn  !=*  j  ; 

End  ; 

End  ;<  a'  b'  ) 

3  : 


.-'ll 

■:>- 


i 

« 
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If  <  <a_i nput  *  '1') 
And 

<b_input  =»  '  1 '  > 
And 


<n 

cel  1 C 

x  3  = 

'  n7 

Or 

<  <  a. 

_i  nput 

-  '1' 

) 

And 

<n 

cel  1  C 

XIs* 

'n8 

')> 

Or 

<<b 

_i  nput 

-  '1' 

) 

And 

( n_ 

cel  1  C 

x  3  - 

'n9 

Then 

i  n  C 

ab  > 

Wi  th  and__p  1  ane 

C  i 

,J  3 

Do 

Beg 

i  n 

If 

<  i  Mod 

2) 

=  1 

Then 

cell  : 

SB  ' 

ac4  ' 

Else 

cel  1  : 

=  / 

ac2 

'  . 

% 

row  : 

«  » ; 

col umn  :=  j ; 

End  ; 

End  ;{  ab  } 

If  < (a_i  nput  *  'l') 

And 

<b_i  nput  *  '0'> 

And 

<n_cel U  x  3  -  'nlO  ')) 

Or” 

<<a_input  *  '  1  '  > 

And 

<n_cel  1C  x  3  ■  "nil  ')) 

Or 

<<b_input  *  '0') 

And 

<n_cel  1C  x  3  »  "nl2  ')>  Then 
Begin  <  a'b  > 

With  And_planeC  i ,j  3  Do 
Begi  n 


If 

< i  Mod  2) 

=  1  Then 

cell  :=  ' 

ac3  ' 

Else 

cell  ' 

ac  l  '  ; 

row 

*■  • ; 

col umn  :*  j ; 

End  ; 

End  ; <a'  b) 
End  ;<*  case  *) 

END  ;<  Logic  > 

PROCEDURE  Or_Logic  (VAR 


> 


J  * 

aa_output  :  Integer 

BEGIN 

Case  <  i  Mod2>  Of 
1  : 

If  aa_output  *  1  Then 

With  or_planeC  i , j  ]  do 
Beg  i  n 

I-f  <j  Mod  2)  =*  0  Then 
cell  :*  ' oc4' 

El  se 

cell  :*  '  oc3  ' ; 

End  ; 

0  : 

I-f  aa_ouput  *  1  Then 
With  or _planet  i , j  3  do 
Segi  n 

I-f  (j  Mod  2)  =*  0  Then 
cell  :=  ' oc2' 

El  se 

cel  1  :*  'ocl' ; 

End  ; 

End  ; Cease) 

END  ; 

BEGIN  <*  main  *) 

Read_data  <n_ cell); 

Wr i te_l  i ne  ; 

Initial i ze_and; 

Wr  i  te__l  i  ne  ; 

For  i  :»  I  To  AND_ROU  do 
Begin  C  i  ) 

For  j  1  To  ANO_COL  do 
Begin  (  j  ) 
x  :»  J  i 

If  <  <  j  —  i >  Mod  4)  »  0  then 
Begi  n 

Read  <a_input,  ch,  b_input,  ch); 
End  ; 

And_logic  <i,  j,  a_input,  b_input); 
End  ;  C  j  ) 

Re ad In  ; 

End  C  i  ) 

Wr  i  te 1 n  ; 

Wr i te 1 n  ; 

Initial i ze_or  ; 

Write  1 i ne  ; 

Writeln  ('call  cell  for  AND_array  '); 

Wr i te_l i ne  ; 

Pr i nt_And_coordi nates; 

Wr i te_l i ne  ; 

Writeln  ; 

For  i  !■  1  To  OR  ROW  do 


LAMBDA 


9.1  VIA  SIZE .  2X2 

9.2  VIA  SEPERATION .  2 

9.3  METAL  2  OVERLAP .  1 

9.4  SPACE  TO  POLY  OR  ACTIVE  EDGE .  2 

9.5  SPACE  TO  CONTACT .  2 
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B ■ 8  METAL  1 


LAMBDA 


8.1  MINIMUM  UIDTH  .  3 

8.2  MINIMUM  SPACE  .  3 

8.3  MINIMUM  POLY  CONTACT  OVERLAP .  1 


8.4  MINIMUM  ACTIVE  CONTACT  OVERLAP 
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7.1  CONTACT  TO  ACTIVE 

LAMBDA 


7.1  CONTACT  SIZE .  2X2 

7.2  ACTIVE  OVERLAP  .  1 

7.3  SPACING  ON  &*1E  ACTIVE .  2 

7.4  SPACING  ON  DIFFERENT  ACTIVE .  6 

7.5  SPACING  TO  DIFFERENT  ACTIVE .  5 

7.6  SPACE  TO  GATE .  2 


7.7  SPACE  TO  POLY  OVER  FIELD,  SHORT  RUN.  ...  2 

7.8  SPACE  TO  POLY  OVER  FIELD,  LONG  RUN  ....  3 


— ^7.7^ — 


B.6  CONTACT  TO  POLY 

LAMBDA 

CONTACT  SIZE .  2X2 

POLY  OVERLAP  OF  CONTACT .  1 

SPACING  ON  SAME  POLY .  2 

SPACING  ON  DIFFERENT  POLY .  5 

SPACE  TO  OTHER  POLY .  4 

SPACE  TO  ACTIVE;  SHORT  RUN .  2 

SPACE  TO  ACTIVE,  LONG  RUN .  3 


“H  6,5 


B.5  PSELECT 

LAMBDA 

PSELECT  SPACE  (OVERLAP)  TO  (OF)  GATE.  ...  3 
PSELECT  SPACE  (OVERLAP)  TO  (OF)  ACTIVE.  .  .  2 
PSELECT  SPACE  (OVERLAP)  TO 


(OF)  CONTACT  TO  SUBSTRATE 


1 


LAMBDA 


MINIMUM  WIDTH  ......  .  2 

MINIMUM  SPACE  .  2 

GATE  OVERLAP  OF  ACTIVE .  2 

ACTIVE  OVERLAP  OF  GATE .  2 

PI  ELD  POLY  TO  ACTIVE .  1 


121 


B . 3  ACTIVE 


LAMBDA 


3.1  MINIMUM  WIDTH  .  2 

3.2  MINIMUM  SPACE  .  3 

3.3  SOURCE/DRAU  ACTIVE  TO  WELL  EDGE .  6 


3.4  SUBSTRATE  CONTACT  ACTIVE  OVERLAP 
OF  SUBSTRATE  . 


3.2  P-UELL  (N-WELL) 


LAMBDA 


WELL  WIDTH . 6 

WELL  SPACE . 6 


h*-2.2 


11? 


B.l  LEGEND 


P  well 


P  select 


Active 


Polysilicon 


Contact  to  poly 


Contact  to  active 


First  metal 


Via 


Second  metal 
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The  more  debatable  rule  is  the  use  o-f  stacked  vias. 
The  contact  -from  second  metal  to  active  with  a  stacked  via 
is  illustrated  in  B-l  1 .  Also  shown  are  other  versions  o-f 
designs  which  abide  by  the  rules.  Since  control  lines  are 
most  o-f  ten  implemented  in  the  -first  metal  in  order  to 
connect  to  poly  gates  conveniently,  the  second  metal  o-ften 
must  connect  to  active  -for  data  lines  and  supply  lines.  A 
possible  compromise  on  the  rules  would  be  to  requ i re  only  a 
one  lambda  active  overlap  o-f  the  via,  taking  into  account 
the  less  steep  edge  o-f  the  active  due  to  the  commonly  used 
selective  oxidation  processing.  This  would  not  require 
analyzing  the  active  pad.  With  this  compromise,  the  PPL 
core  cell  is  increased  approx imatel y  10X  over  one  with  the 
stacked  via. 

Thus  the  stacked  via  is  most  convenient  and  results  in 


smaller  layouts.  MOSIS  needs  to  support  the  stacked  via. 


The  lambda  based  scalable  design  rules  presented  here 
have  been  used  for  this  project.  Lambda  is  equal  to  1.3 
microns  for  3  micron  technology,  and  0.8  microns  for  1.2 
micron  technology.  A  few  exceptions  have  been  made, 
however,  because  the  project  started  well  before  the  rules 
came  into  effect.  The  exceptions  are: 

1)  The  spacing  between  two  different  active  regions 
is  three  lambda  instead  of  four  lambda. 

2)  The  first  metal  minimum  width  is  two  lambda 
instead  of  three  lambda. 

3)  Stacked  v i as  <via  over  a  contact)  are  used  for 
the  designs. 

The  first  exception  can  be  accommodated  without  much 
loss  in  the  area,  but  the  next  two  need  consideration. 
Since  the  first  metal  is  used  for  local  interconnect,  and 
in  most  designs  limits  the  density  of  the  layout,  it  is 
very  desirable  to  have  a  two  lambda  minimum  width. 
Silicon-gate  CMOS  without  buried  contacts  places  an 
additional  interconnect  burden  on  the  first  metal.  Even 
using  a  two  lambda  first  metal  minimum  width  with  the 
scalable  rules,  the  first  metal  minimum  pitch  is  7.3 
microns  for  the  3-micron  process,  compared  with  the  MOSIS 
standard  of  7  microns.  So  a  first  metal  minimum  width  of 


two  lambda  is  a  reasonable  compromise. 


Design  rule  issues  .  .  . 

B-l  Legend.  . 

B-2  P-we 11 . 

B-3  Active . 

B-4  Polysilicon  .  .  . 

3-5  P-select.  .  .  .  . 

B-6  Contact  to  Poly  . 

B-7  Contact  to  Active 
B-8  First  Metal  .  .  . 

B— 9  Via  . 

B-10  Second  Metal .  .  . 

B-ll  Glass  . 

B-12  Via  contacts.  .  . 


Begin 

Read  <aa_output); 

Or_logic  <i,  j,  aa_ output) 
End  ; 

End  ; 

Ur i te_1 i ne  j 

Unite  In  ('call  cells  -for  0r— array 
Ur  i  te_J  i  ne  ; 

Ur i t  e 1 n  ; 

Pr i nt_or_coordi nates; 

Ur i tel n  ; 
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B  ■  1 0  METAL  2 


LAMBDA 


10.1  MINIMUM  WIDTH .  3 

10.2  MINIMUM  SPACE  .  4 

10.3  MINIMUM  VIA  OVERLAP .  1 


LAMBDA 


11.1  MINIMUM  BONDING  PAD . 100  x  100 

11.2  MINIMUM  PROBE  PAD . 73  x  73 

11.3  PAD  TO  GLASS  EDGE .  5 


Stacked  contact 


} 


Via  outside  active 


i 
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